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A CROSS-CULTURAL STUDY OF AUTHORITARIANISM* 


Western. Washington State Gollege and North Dakota State University 


Ковккт D. MEADE! AND JAMES О. WHITTAKER 


A. INTRODUCTION 


Adorno et al. (1) report that the personality dimension which they have 
labeled authoritarianism is closely related to family and cultural conditioning 
of the individual. In the development of the California F Scale designed to 
measure this personality factor, quite different results were obtained between 
various subgroups within the American culture. Later, studies by Christie and 
Garcia (2) and Courtney et al. (4), which investigated various other sub- 
cultures in America, also reported significant differences. Minkowich and 
Shaked (7) have found considerable variation in authoritarian tendencies in 
various subcultures in Israel. 

Since these studies have found differences between various ethnic and 
social groups within the confines of a single nation, it is to be expected that 
still greater differences in authoritarianism would be found in cultures which 
show an even greater divergence from any one national standard that may 
be assumed. The few empirical studies that have been undertaken outside the 
United States reveal that such is the case. Prothro and Melikian (8, 9) and 
Melikian (5, 6) report a significantly higher level of authoritarianism in 
college students from various Arabian cultures than is found among a com- 
parable student group in the United States. This effect is reported for both 
Christian as well as Moslem respondents. Caladarci (3), on the other hand, 
reports no differences in authoritarianism between Japanese and Americans 
who are involved in the field of education. 

A strict comparison between the results of these different studies is rendered 
difficult if not impossible because of confounding by many factors other than 
those that are strictly cultural. Differences in ages, education, and occupation 
are notable among these factors. For this reason and also because of a general 
paucity of data from cultural groups outside America, the present study was 
undertaken. 


* Received in the Editorial Office, Provincetown, Massachusetts, on October 26, 
1966, and given special consideration in accordance with our policy for cross-cultural 
research. Copyright, 1967, by The Journal Press. —. esas Р 

1 This атн was CE d while the senior author was visiting Fulbright 
Professor at Balwant Rajput College, Agra University, Agra, Uttar Pradesh, India. 
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Five groups of respondents known on general grounds to be culturally dis- 
parate were chosen purposefully for making comparison with a sixth group 
from the United States. Within each cultural group, specific rather than 
random samples were investigated. These groups were all college students 
and are comparable with groups found to be exercising an increasing role in 
leadership functions in the societies of which they are members. Selection of 
this type of respondent also circumvented the problem of illiteracy prevalent 
in some of the cultures chosen for study. The groups consisted of three from 
Asia and one each from Africa, South America, and the United States. 

The African group was made up of 50 males and 23 females from the 
Mashona people whose traditional home is what is now Rhodesia. These 
people until recently lived under a tribal system in which elders made nearly 
all decisions and where the tribal council and chief were responsible for general 
policy. Most of the respondents had attended missionary schools and some had 
adopted the Christian religion. 

The first Asian group was made up of 40 males and 19 females from the 
Near Eastern Arabian countries of Jordan, Lebanon, Egypt, and Sudan. This 
culture is characterized by strong leadership from the male members of the 
group, as well as strong adherence to religious traditions. Nearly all were 
Moslems, but a few were Christians. 

‘The second Asian group was from the Gangetic Plain of Uttar Pradesh in 
northern India and was made up of 30 males and 24 females. All but three, 
who were from the Sikh religion, were Hindus and there was a representation 
from each of the four major castes. 

The third Asian group was made up of Chinese who live in the New Terri- 
tories of Hong Kong. This group was comprised of 16 males and 46 females. 
Most belonged to the Taoist religion, some were Buddhists, and there were 
a few Christians. Many were refugees from the Chinese mainland. 

The South American group lived in Brazil. The background of these 24 
male and 46 female respondents was primarily Portuguese with some native 
Indian mixture of several generations longstanding. Nearly all were members 
of the Roman Catholic faith. 

The American cultural group consisting of 41 males and 44 females was 
from both public and private colleges in New England and the Pacific North- 
west. Seventy-six per cent were Protestant and 24 per cent were Roman 
Catholic, 

All respondents in these six cultures were regularly enrolled students at- 
tending college on a full-time basis, They were given the California F Scale 
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printed in the language used for instruction in their respective colleges. English 
was used for Americans, Arabs, Chinese, and Africans; Hindi for the Indian 
students; and Portuguese for the Brazilians. Subjects were instructed to com- 
plete the questionnaire by using a seven-point scale for each item which ranged 
from strong agreement to strong disagreement. The questionnaires were ad- 
ministered by a member of the faculty of the institution the students were 
attending and hence the study could not be attributed to an American 
investigator. 
C. RzsuLTS AND Discussion 


None of the mean scores for male and female respondents in the six groups 
differed significantly, and therefore they were combined for purposes of com- 
parison with other groups in Table 1. The most striking result is that all 
five cultures, when compared with American students, show a significantly 
higher degree of authoritarianism. This conclusion is supported statistically 


TABLE 1 
MEAN F-ScALE SCORES FOR Six CULTURAL GROUPS 
Culture Institution N Mean SD 
U.S.A. Trinity College, Connecticut; 
and Western Washington State College [UE m SCRI Td 
India Agra College, Agra, U.P. 54 506 .57 
Rhodesia Umtali College, Umtali; 
and University College, Salisbury 73 491 .62 
Brazil Sao Paulo University, Sao Paulo 70 402 .90 
Arabia American University, Beirut 59 445  .63 
Hong Kong Chung Chi College, Chung Chi 62 461 .54 


when the American data are compared with those from each of the other 
cultures, These results are found in Table 2. All of the values comparing 
Americans with other cultural groups shown in Table 2 justify rejection of 
the null hypothesis well beyond the .01 level of confidence. 


TABLE 2 
STATISTICAL COMPARISON OF Six CULTURAL GROUPS USING t-TEST 

Culture U.S.A. Brazil Arabia Hong Kong Rhodesia 
Brazil 6.62* 
RER 12.90* 3.07* d 

ong Kong 16.11* 4,45* б 
Rhodesia 19.44* 6.85* 4.18* 2.00** 
India 19.00* 7.43* 5.55% 4.09* 1.35 
TL 


** 5 © 05 0l. 
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Secondly, Brazilian students, while significantly higher in authoritarianism 
than American students, are significantly lower than each of the other cultural 
groups. 

Listed in order of increasing authoritarianism, the cultural groups are 
United States, Brazil, Arabia, Hong Kong, Rhodesia, and India. Each of 
these groups, however, does not have a significantly higher score than the 
one preceding it. It is quite clear that, while Indians and Rhodesians do not 
differ from each other significantly, taken together they are more authoritarian 
than all other cultures tested. Students from Hong Kong and Arabia appear 
to constitute a high intermediate degree of authoritarianism and do not differ 
significantly from each other, while at the same time they differ from the other 
groups. 

It should be noted that the conclusions concerning the relative degree of 
authoritarianism in Americans and Arabians corroborates the earlier findings 
of Prothro and Melikian (8, 9). 

No attempt can be made here to analyze how one cultural group has come 
to be more authoritarian than any of the others. Differences in religion, family 
structure, child-rearing practices, education, and type of government, working 
singly or in interaction with others, may provide an explanation, The cultural 
groups which are represented here quite obviously differ with respect to all 
of these factors and many more as well. Further research is needed before 
specific factors contributing to authoritarianism or lack of it in each culture 
studied are isolated. Respondents tested in this investigation, moreover, were 
all college students, and it seems reasonable to expect that they represented 
the more liberal elements of the cultural groups to which they belong, It is 
quite likely that testing other educational strata of these cultures would reveal 
an even higher degree of authoritarianism. Indeed, it is also reasonable to 
expect significant variation among the various subcultures which comprise 
the broader national groups. Such differences suggest the need to investigate 
the role played by authoritarianism in the social relations of these cultures 
and subcultures. 

D. Summary 


The personality dimension, authoritarianism, was investigated in six cul- 
turally disparate groups of college students: Americans, Arabs, Rhodesian 
Africans (Mashona), Chinese, Indians, and Brazilians. Mean scores on the 
California F Scale were significantly lower for Americans than for any of 
the other groups. Brazilians scored significantly lower than all except Ameri- 
cans. Arabians and Chinese showed no difference and together constituted the 
next highest level of authoritarianism. Rhodesians and Indians exhibited no 
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differences and together were highest in authoritarianism. Implications for 
further research in personality, leadership, and social relations employing this 
parameter of personality are discussed, 
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THE INFLUENCE OF CERTAIN DEVELOPMENTAL 
FACTORS IN FOSTERING THE ABILITY TO DIFFERENTIATE 
THE PASSAGE OF TIME* 1 


Harvard University 


Davip ZERN 


A. INTRODUCTION 


An ability to differentiate the passage of time may well be a very complicated 
ability to acquire. Many developmental psychologists, including Piaget (6), 
Goldstone (4), and Sturt (8), emphasize the uniform occurrence of the 
learning. On the other hand, such psychoanalytically oriented theorists as 
Schilder (7) and Eissler (3) stress the importance of various psychological 
and cultural factors in effecting individual variations. Bergler and Roheim 
(2) extend this line of reasoning with the particular hypothesis that inten- 
sive interaction between mother and child early in life will inhibit and dis- 
turb the development of ego mechanisms necessary to cope with established 
time differentiation and leave the individual with a very inadequate sense 
of time. 

Although Bergler and Roheim never make the point explicitly, it seems that 
the relevant dynamic process involved was the "necessity" for the child of 
developing a very keen sense of time as a helpful way of securing mastery 
over the outside world, if and only if maternal indulgence was not forth- 
coming. Then it is primarily not the maternal interaction as such, but rather 
the overall indulgence or lack of it that is most salient in the development of 
a differentiated sense of time. The anecdotal evidence presented by Bergler 
and Roheim does not resolve this issue. Therefore a more critical test of the 
two hypotheses was carried out through a cross-cultural replication, utilizing 
normative data from many different societies. 


B. THE SAMPLE 


A society was included in the primary sample, which will be discussed in 
the paper, if and only if it met the following three requirements: (a) there 


* Received in the Editorial Office, Provincetown, Massachusetts, on November w 
1966, and given special consideration in accordance with our policy for cross-cultura 


research, Copyright, 1967, by The Journal Press. Mie ; i 
2 Thanks ate duc to the Milton Fund of Harvard University, whose grant provided 


the funds for the independent ratings which were carried out on the dependent 
Measure, 
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was sufficient information in the Human Relations Area Files to rate the 
dependent variable (the ability to differentiate the passage of time); (5) 
Barry, Bacon, and Child (1) had rated the society as to general indulgence 
shown to the infant; and (c) each society satisfying the first two criteria 
belonged to an independent linguistic family or subfamily, as rated on the 
Murdock World Ethnographic Sample. The final sample was composed of 
29 societies: five came from Africa, eight from North America, five from 
Oceania, seven from South America, and four from the Asian-Russian land 
mass. 


C. Мвтнор 
1. The Dependent Variable 


In line with the cross-cultural nature of the study, an attempt was made 
to identify and relate many possible evidences of a society's concern with time 
and measurement of its passage. The items selected as of particular relevance 
are the result of a prior investigation by the author. From this first study, it 
was easy to pick out a number of “symptoms” of a society’s concern with time 
as manifested by the way it seeks to measure certain parts of its “life space. "2 
Having this in mind, the following items were chosen as significant: (a) 
a developed calendar of a year in length (approximately) ; (5) a break-up of 
the month into different segments (like our own wecks, for example) ; (c) a 
naming of particular days of the “weck” or month; (4) a series of dates 
regularly recurring at the same time each year and of a special ritual signifi- 
cance; (e) a recognition and a naming of monthly type splits, regardless of 
the existence of a total yearly calendar; and (f) a crude and general ability 
to differentiate parts of the day. All societies were then rated on the presence 
or absence of each of the six “symptoms” of a society’s concern with time, 
independently of the presence or absence of any of the other variables. Two 
independent raters with no knowledge of the final hypotheses made these 
ratings. A single discussion following the initial rating was sufficient to resolve 
the few disagreements that had occurred (approximately 5 per cent of the 
total ratings). In cases where only one rater felt that there was enough 
material on which to rate the society on the given variable, his rating was 


2 Thanks are due to J. W. M. Witing for suggesting that the time variable was 
no unitary measure and that it would be useful, and more accurate, to delineate differ- 
ent measures of time differentiation and concern—and from there to attempt a Guttman 
scaling of the variables. The latter provides a more objective estimation of the differ- 
ences among the various societies on the measured variable. 

3 A majority of even these disagreements was the result of a misunderstanding of 
the instructions by one of the raters as regarded item (a). 
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nevertheless used, with the exceptions to be noted below in the results of the 
experiment. 

A preliminary examination of the resulting data revealed some inadequacy 
in the materials. It was decided to refine item (f) further by having two 
raters re-examine those societies where some differentiation of parts of the 
day took place. They rated the society on a seven-point scale according to the 
elaboration of that differentiation. A further item (c) was created out of 
those societies that scored at least 11 out of the total of 14 points available, 

It was then possible to construct a Guttman scale utilizing the resulting 
ratings in the order in which they are named above, when the elabora- 
tion of parts of the day (c) is included between (c) and (4). On this basis, 
for example, it could be predicted that a society in which the members had 
à developed calendar would be most likely to have names for days of the week, 
while an accurate prediction in the opposite direction could not be made. The 
percentage of individual items that are correctly reproduced in the constructed 
scale is 88.6 per cent.* More important than the reproducibility of the scale, 
however, is the fact that a natural hierarchy seems to spring out of the scaling, 
providing us with a way of assessing what societies seem to be most concerned 
with time. It makes eminent sense, for example, that a developed calendar 
should be a more important measure (a better predictor item, generally, that 
is) than a simple recognition of months, since the latter is so obviously related 
to the gross natural phenemenon of the moon and its cycles. 


2. The Independent Variable 


As an independent assessment of the child-rearing variable, it was decided 
to use a rating scale devised by Bacon, Barry, and Child (1) and cited by 
Whiting (9). The appropriateness of the scale can best be demonstrated by 
an extensive quotation from the rating instructions for the scale: 


The initial period will be defined as approximately the first year (and 
approximately as long thereafter as the treatment of the infant remains 
constant, but if there is any change, it is the first year that is dealt with). 
This means that if there is a change in treatment at the end of the 
neonatal period, the treatment characteristic of the rest of infancy is 
what will be dealt with here, not the treatment of the neonate. 
Separate ratings should be made of as many as possible of the following 
variables: 
(1) Display of affection toward the infant; to what extent is he held, 
fondled, caressed, played with? 

—— 

* For further information on criterion for and attributes of a Guttman scale, see 
Green (5, pp. 351-358). 
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(2) Protection from environmental discomforts: e.g., excessive heat or 
cold, bright light, insects. To what extent are such discomforts either not 
experienced, usually prevented, or quickly eliminated? 

(3) Degree of drive reduction: considering particularly hunger, thirst, 
and unidentified discomforts, how fully are the infants needs reduced? 
(4) Immediacy of drive reduction: how quickly are these needs reduced? 
(5) Consistency of drive reduction: how consistently are drives reduced? 
Here it is necessary to take into account not only drives which parents 
can always reduce (e.g, in most cases hunger), but also drives which 
parents may be unable to reduce (e.g, chronic discomfort from illness). 
(6) Constancy of presence of nurturant agent: if the mother is not always 
present, the rating of constancy will then be affected by whether the infant 
is left alone, with a moderately nurturant substitute, a highly nurturant 
substitute, etc. 

(7) Absence of pain inflicted by the nurturant agent: e.g, cold baths, 
depilation, rough handling, or physical punishment. 

A rating is then made of overall indulgence during infancy, based on the 
separate variables listed above, weighted as seems proper, and on any 
other information considered pertinent including general statements about 
infantile indulgence. The duration of the initial period should be given 
some slight weight here (1). 


As can be inferred from the theoretical discussion in the Introduction, most 
weight was placed on the rating described as "overall indulgence" (hereafter 
described as variable No. 8) and on variable No. 6, although mention will be 
made of the results of all the correlations. It is important to note that, despite 
the instructions for independent rating, extremely high intercorrelations among 
the eight variables resulted (generally in the high .80's). Thus, it seems fair 
to say that the scales have all measured the same basic dimension in child- 
rearing with some slight modifications. Alternatively, we could argue, with 
the same effect, that parents are consistent along these differing realms of 
behavior. 
D. RESULTS 


Using the rankings as established by the Guttman scaling made it possible 
to give a numerical score to each of the societies involved in the study cor- 
responding to the rank of the most salient predictor item that they possessed. 
In those few cases where the best predictor item had been ascertained by only 
one of the raters, with the second holding the information not ascertainable, 
a third rater was called in to rate the society on each of the variables in the 
study. Only if she agreed that the society was ratable on the critical measure 
was the score given. If she did not agree, the item was marked “not ascertain- 
able," and the society was given the next lower number for which there was 
agreement as to the presence of the time "symptoms." To be given a score of 
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“1,” indicating that the society had no items demonstrating a time concern, 
it was necessary that clear evidence be found against the presence of at least 
five out of the seven items, with the proviso that the rest were not ascertainable. 

It was arbitrarily decided to give the best predictor item (developed 
calendar) the highest score—an “8”—and so on down to where a society 
having no "symptoms" of time concern would get a score of “1.” The com- 
plete ratings of societies with respect to time differentiation were based on 
the following order: 8—a developed calendar of a "year" in length, 7—a 
break-up of the month into different segments, 6—a naming of the particular 
days of the week or month, 5—an elaborated differentiation of parts of the 
day, 4—a series of dates each year of ritual significance, 3—a recognition and 
naming of different months, 2—a crude and general differentiation of parts 
of the day, 1—the absence of any of the above. Each society is given the largest 
possible numerical rating consistent with its own particular set of "symptoms". 
of time concern. Thus, for example, the Yoruba were given a score of "7," 
indicating that they did break up the month into different segments, but had 
no developed calendar. Similarly, the Aranda received a score of only “2” 
because they did not have any better predictor of time concern than a crude 
differentiation of the part of the day.* 

The result of the correlations will be found in Table 1, but some analysis 
and interpretation of the various results is plainly in order. A certain con- 


TABLE 1 
CORRELATIONS OF CHILD-REARING VARIABLES WITH TIME CONCERN 
Number of . 
Variable societies Correlation 
1. Display of affection 29 E 
2. Protection from discomforts 26 TED 
3. Degree of drive reduction 28 —.305 
4. Immediacy of drive reduction 24 218 
5. Consistency of drive reduction 23 —.328 
6. Constant presence of 
nurturant agent 28 —:224 
7. Absence of pain inflicted Ы 
by nurturant agent 24 me 
8. Overall infant indulgence 29 —.438) 
оре 104! 
“p< .01. 
EAE 10. 


sistency may be expected їп the findings as an artifact of the high correlation 
of the various measures of indulgence. Thus, while there are highly satisfactory 
eS Е 

5 See the Appendix for a complete listing of each society’s ratings on independent 
and dependent measures. 
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results, the fact that all relationships are in the proper direction is no guarantee 
of eight independent confirmations of the hypothesis. It is quite possible that 
there may be only one dimension involved in the eight measures of the child- 
rearing variable. Nevertheless, a close examination of the findings does reveal 
some relevant differences in the correlations that may be of interest. 


The highest correlation of all exists between the time measure and what 
the creators of the scales themselves intended to be the best measure of the 
independent variable. This indicates that it is general indulgence of the infant 
(rather than any particular aspect of that indulgence) that is most important 
as a predictive factor. Of especial relevance is the inadequacy of variable 6, 
which comes closest to the description offered by Bergler and Roheim, to 
account for the establishment of a concern for time in the culture. In this 
particular connection, also, it is relevant to note that the second best predictive 
variable—7—concerns the nurturant agent, but only insofar as it either 
frustrates, or fails to frustrate, the child; here, the mere presence is irrelevant. 


E. Discussion 


Thus, despite the acknowledged inadequacy of correlations to establish any 
causal connections, the results of these particular sets of data go a long way 
toward refining the exact type of interaction that accompanies a differentiated 
cultural sense of time: a lack of overall indulgence in the first year of life 
is crucial, especially as this suggests that little affectionate handling is shown 
the child and some pain is inflicted by the nurturing agent. It makes relatively 
little difference whether the chief nurturing agent is usually present, or just 
how fast discomforts are reduced as long as they are allowed to subsist for 
some time. 

Even if the validity of the major correlations is accepted, there is, of course, 
no such necessity to credit the causal explanations advanced here and by 
Bergler and Roheim. Obviously, other connections must be considered. At 
first glance, it appears difficult to understand how time differentiation could 
cause different kinds of child-rearing, but it is relatively easy to see how both 
variables could be the result of a third factor in a way that would suggest 
an apparently radically different explanation from the one given above. The 
third factor could be designated “cultural complexity.” Then we could say 
that those societies which possess a generally more complex civilization—and 
on that account would almost by definition necessarily have a more developed 
sense of time—have in a sense too many other activities for its members to 
perform to allow the latter to indulge its infants. 

Of course, such a theory is possible, although manifestly difficult to confirm, 
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because it would be very hard to secure an adequate measure of complexity 
which would be acceptable to all. It should, more importantly, be clear that 
even if the correlations of complexity (measured satisfactorily) with child- 
rearing variables should be high, this does not produce sufficient justification 
for the abandonment of the larger implications of the presented hypothesis. 
It is perfectly consistent with—indeed, may well be implied by—an extension 
of the Bergler and Roheim suggestion, that an infant negligently reared be 
motivated to organize his whole world, not just his time perceptions. In this 
case, the direction of the influence would stay the same as that one originally 
proposed. Still, it is generally important to consider the relevance of such 
new data for the additional refinement it would bring to an understanding 
of the relationship between the main variables. 

Similar types of issues are raised at an individual, as well as a cultural, 
level of analysis, and at least bear mentioning. At this point, it is impossible 
to say whether time is unique as a dependent variable, or whether the findings 
in this area could extend to other aspects of the individual’s life space. That 
is, does early frustration "force" the child to structure his world spatially 
as well as temporally in order to achieve gratification better? Or are the 
findings limited to perception of time and its differentiation at the individual 
or cultural level? Only future research will be capable of dealing adequately 
with such questions. 


F. SUMMARY 


A cross-cultural study of 29 linguistically independent societies is utilized 
in order to determine the effect of maternal interaction or overall indulgence 
of the infant during its first year of life on the adult’s differentiation of the 
Passage of time. The measurement of the dependent variable is a Guttman 
scaling of various calendrical devices of the society, such as developed calendar 
and elaborate differentiation of parts of the day. The results argue for the 
primary importance of overall indulgence, with maternal interaction (as a 
separate phenomenon) clearly in a secondary role, in the production of an 
undifferentiated sense of time; similarly, lack of indulgence (the presence of 
frustration) is correlated with a structured ordering of the time dimension. 
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APPENDIX 
TABLE 2 
RATINGS or 29 SOCIETIES ON INDEPENDENT AND DEPENDENT MEASURES 

Society Time 1 2 3 4 5 6 7 8 
Tanala 8 342 38. ‚347,131 33. 15 5 9 
Yakut 8 9 4 7 6 5 9 8 7 
Aleutian 8 5 7 nu — — = 5 $ 
Yoruba 7 11 T H^ 14^*1$ a 11 
Trobriand 7 и оаа I3 =, 12.230 12 
Yahgan 7 12 9. 12 12 10 1$ — 10 
Tepotzlean 7 7 7 10 9 9 310: 8 8 
Marquesans 6 4 4 2 2 2 9 4 2 
Lepcha 5 ш, TE AT Ды? М: 
Siriono 5 12 алат 9 UM 6 10 
Andamans 5 1$..— .12 14. 13$, 14...15 1 
Aymara 5 4 7 10 8 61 10 6 
Araucanians 5 10 10 11 — — 12 Sorat 
Ojibwa 5 1.31 12 11 11 12 9 10 
Thonga 5 $ é 8 10 10 9 4 7 
Wogceo * HM. 1 MI M 1$ 13 
Papago 4 ТҮЙ e HUS IURE 
Comanche E 10 232 ^12. 11^ H 5 120 35 E 
Navajo 4 8 3.1 30 Жү X4 13 9 10 
Chukchee 3 12 9 12 — — uU — 1d 
Azande 3 9 ыш 11 314- is 9 10 
Nuer 3 12 9241312* 12... 12 ЛА 9 11 
Kaska 3 яф} ыб! (фы! Жр. С EA 11 
Omaha 3 Ро "M гүлө 137712 
Cuna 3 1 11 13 — — i 9 12 
Aranda 2 10 12-12) 32 1278 $ 1H 
Lesu 2 14 — 11 10 8 S083. » 11 
Jivaro 1 п 10 — — — 14 — I 
Timbira 1 i = 2 E hurtta ASA 12 
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CONNOTATIONS OF RACIAL CONCEPTS AND COLOR 
NAMES IN GERMANY” 1 


Department of Psychology, Wake Forest College 


Joun E. WirLiAMs AND Dorotuy JEAN CARTER? 


A. INTRODUCTION 


In accordance with a well-established language custom, racial groups are 
commonly designated by a “color-code” in which Caucasians are called white, 
Negroes are referred to as black, Orientals are yellow, American Indians are 
red, and Southeast Asians are brown. Although obviously not providing an 
accurate description of skin color, the color-coding custom seems to be con- 
sidered a convenient and innocuous way to designate racial groups. A problem 
arises, however, from the fact that color names are used symbolically in a 
variety of nonracial contexts and have come to have quite distinctive connota- 
tive meanings (4). Among Americans, the color name black, for example, 
carries the connotations “bad,” “passive,” and "strong," while the color name 
white has the connotations “good,” "active," and "weak." Since color names 
have distinctive meanings, the regular association of a color name with a 
racial group may influence the way the racial group is perceived, an effect sug- 
gested by a study (1) in which nonsense syllables tended to acquire the con- 
notative meanings of color names with which they were associated in a paired- 
associate learning task. 

If the practice of color-coding racial groups does result in the conditioning of 
color connotations, it would be expected that the connotative meaning of a 
given racial name would be more similar to the meaning of the associated 
color name than to the color names used to designate other groups. In a 
study (5) designed to test this hypothesis, separate groups of American Cau- 
casian college students made semantic differential ratings of color names 
(white, black, brown, yellow, red), color-person concepts (white person, 
black person, brown person, yellow person, red person), and ethnic concepts 


———— 


* Received in the Editorial Office, Provincetown, Massachusetts, on November 14, 
1966, and given special consideration in accordance with our policy for cross-cultural 
research. Copyrigh by The Journal Press. 

1 This SUD eH AEST in D by a grant to the first author from the Wake 
Forest College Graduate Council. The authors are indebted to the Fulbright Com- 
Mission, and Professor Kurt Gottschaldt, Director of the Psychology Institute at 
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Now at the Institute for Juvenile Research, Chicago, Illinois. 
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(Caucasian, Negro, Indian-Asiatic, Oriental, Indian-American). When se- 
mantic distance scores were compared, it was found, as predicted, that triads 
of concepts linked by the color code (е.р., Black-Black person-Negro) were 
significantly more similar in connotative meaning than were triads of concepts 
not so linked (e.g., Black-Red person-Oriental). 

The study to be reported here was an attempt to replicate these effects in 
a Caucasian group from outside the United States in order to determine 
whether these findings were specific to American culture and language systems, 
or whether the effects had greater generality within Western culture. 


B. METHOD 


The method of this study closely paralleled the methods of the two previous 
studies (4, 5) where detailed descriptions of materials and procedures may 
be found. 

1. Subjects 


Subjects were 113 Caucasian students of German nationality who were 
students at Georg-August-Universität, Göttingen, West Germany. Ss were 
divided into three research groups: 50 Ss rated 10 color names, 34 Ss rated 
10 color-person concepts, and 29 Ss rated 15 ethnic-national and control con- 
cepts. In each of the groups, the proportions of men and women were approx- 
imately equal. 


2. Research Materials 


Since this study was intended as a cross-national replication of the earlier 
study (5), the first task was the translation of the semantic differential mate- 
rials and the concepts to be studied from English to German. This was accom- 
plished by the second author, who had a good collegiate background in German, 
with the assistance of another American student studying for the doctorate at 
Góttingen, and a German graduate student who had a good familiarity with 
English. In general, translation consensus was not difficult to achieve although 
the translation of the adjective pair "sacred-profane" proved something of a 
problem, since this is not a common polarity in German; “heilig-profan” was 
selected as the best approximation available. The German translation of the 
English concept “Caucasian” posed a different type of problem, since the 
German “‘Kaukasier” is used in the specific ethnological sense (from the Cau- 
casus region), while “Weisser” is used in the general sense of “white person.” 
While both of the German concepts were rated, Weisser was used in the 
analysis as the German equivalent of the English term Caucasian. 

In all, the translators provided German equivalents of 10 common color 
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names, 10 color-person concepts formed by pairing each color name with the 
word "person," 15 ethnic-national and control concepts, the 12 pairs of rating 
adjectives, and the test-booklet face-sheet which provided conventional se- 
mantic differential instructions. Table 1 displays the German-English equiv- 
alents for the 12 adjective rating pairs, while Table 2 displays the German- 
English equivalents of the five color names, the five color-person concepts, and 
the five ethnic concepts employed in testing the principal hypothesis of the 
study. 


TABLE 1 
GERMAN-ENGLISH EQUIVALENTS OF SEMANTIC DIFFERENTIAL RATING SCALES 

Evaluation scale Potency scale Activity scale 
аво stark—schwach langsam—schnell 
(unpleasant—pleasant) (strong—weak) (slow—fast) 
gut—schlecht klein—gross stumpf—scharf 
(good—bad) (small—large) (dull—sharp) 
stinkend—duftend schwer—leicht aktiy—passiv_ 
(foul—fragrant) (heavy—light) (active—passive) 
‘sauber—schmutzig 
(clean—dirty) 
schén—hasslich 
(nice—awful) 
heilig—profan 


(sacred—profane) 


TABLE 2 
GERMAN-ENGLISH EQUIVALENTS OF CONCEPTS 
Color name Color-person concept Ethnic concept 
Weiss—White Weisse person—White person Weisser—Caucasian 
Schwarz—Black Schwarze person—Black person Neger—Negro 
Braun—Brown Braune person—Brown person peces 
stati 
| . Gelb— Yellow Gelbe person—Yellow person Orientale—Oriental 
Rot—Red Rote person—Red person Indianer—Indian 
(American) 


The rating booklet which each $ used was conventional and consisted of the 
face sheet of general instructions, followed by the concepts to be rated. Each 
concept was printed at the top of a separate page, with the 12 seven-point ad- 
jective scales displayed below. For the color-name rating procedure, and for 
the color-person rating procedure, the 10 concepts were presented to Ss in a 
Tandomized order. For the ethnic-national and reference concepts procedure, 
Ss first rated the concept "person," then four reference concepts in random 
order, and then 10 ethnic-national concepts in random order. 


" 
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3. Procedure 


Most of the data were obtained in classroom settings with the remainder 
gathered in small group administrations at various places about the university, 
with the second author, an American Caucasian, serving as administrator. The 
ratings of the 10 color names were obtained first to insure that these ratings 
would be made in a general, nonracial context. After data for 50 Ss had been 
obtained, an additional 34 Ss were located and given the rating procedure for 
the 10 color names. Following this, an additional group of 29 Ss rated the 
15 ethnic-national and control concepts. 


C. RzsuLTS AND Discussion 


The basic data for study consisted of the Evaluation (E), Potency (P), 
and Activity (A) scores for the five race-related color names, the five race- 
related color-person concepts, and the five corresponding ethnic concepts shown 
in Table 2. Each rating sheet was scored by assigning the digits 1 to 7 to the 
seven positions on each rating scale and summing the ratings on the appro- 
priate scales (sec "Table 1) for the E, P, and A factors. Low scores represented 
the “good” end of the E dimension, the “weak” end of the P dimension, and 
the “passive” end of the A dimension. Table 3 displays the three mean factor 
scores for each of the 15 concepts under study. 

As in the earlier study, the D index developed by Osgood et al. (2, p. 89) 


TABLE 3 
MEAN EVALUATION (E), Porency (P), AND Activity (A) SCORES 
oF COLOR, CoLoR-PERsON, AND ETHNIC CONCEPTS 


Variable E р А 
Color 
Weiss 1.96 3.29 4.58 
Schwarz 3.69 6.06 3.70 
Braun 4.85 4.71 2.61 
Gelb 3.60 3.46 4.73 
Rot 2.63 5.43 6.23 
Color-person 
Weisse person 2.46 3.32 4.54 
Schwarze person 4.45 5.28 3.82 
Braune person 5.04 4.46 3.02 
Gelbe person 4.62 3.25 4.76 
Rote person 3.66 4.40 5.48 
Ethnic concept * 
Weisser 3.48 4.40 4.70 
Neger 3.83 4.90 4.16 
Inder 3.44 3.59 3.91 
Orientale 3.93 3.54 3.83 
Indianer 3.59 447 5.34 


Note: Scores shown are mean factor scores divided by number of scales. 
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was used to measure the degree of connotative meaning similarity between 
concepts. D scores were computed between the following: each color name 
and each color-person concept, each color name and each ethnic concept, and 
each color-person concept and each ethnic concept. Table 4 lists the 75 D 
scores obtained in this way. Since a given D score reflects the "distance" be- 
tween the two concepts in three-dimensional “semantic space,” the smaller 
the D score the greater is the connotative meaning similarity between the 
concepts. 

Mann-Whitney U tests (3) were used to test the principal hypothesis of 
the study: namely, that concepts linked by the color code would have greater 
meaning similarity (lower D scores) than concepts not so linked. For each 
third of Table 4, separately, a U test was computed comparing the five D 
scores for color-linked concepts (along the diagonal) with the 20 other 
D scores in that portion of the table. For the comparison of color-name and 
color-person concepts at the upper left, the five color-linked D scores were 
significantly ( < .001) smaller than the 20 noncolor-linked D scores. For 
the color-person and ethnic D scores at the lower right, the color-linked scores 
were again significantly (p < .025) smaller. For the color-name and ethnic 
D scores at the upper right, the color-linked scores were smaller but the 
effect was of only borderline significance (№ < .10). These findings were taken 
as confirmation of the general hypothesis of the study. 

Figure 1 provides a summary of the relative similarity of color-linked and 
noncolor-linked concepts in this study, along with a similar summary from 
the earlier study of American Ss (5). In the left triangles, the distance be- 
tween any two of the vertices represents the mean of the five D scores for 
color-linked ("related") pairs of concepts: i.e., the mean of the five scores 
along each of the three diagonals in Table 4. The distances between vertices 
in the right-hand triangles represent the mean distance between the 20 non- 
color-linked (“unrelated”) comparisons in each third of Table 4. Thus, the 
relative size of the left and right triangles reflects the relative meaning simi- 
larity among color-related and color-unrelated concepts. 

Comparisons of the upper and lower triangles in Figure 1 indicate that the 
purpose of this study was accomplished: namely, the replication of the findings 
from the American Caucasian study with a non-American Caucasian group. 
It would appear that in Germany, as in the United States, racial concepts 
show a substantial meaning similarity to the color names with which they are 
commonly associated. This, in turn, is consistent with the idea that the 
custom of designating racial groups by color names may serve to condition 
the color meanings to the racial groups, thus influencing the way these groups 
are perceived. 
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MEAN SEMANTIC DISTANCES 


AMERICAN CAUCASIAN Ss 


Related Concepts Unrelated Concepts 


(2.73) 


GERMAN CAUCASIAN Ss 


Related Concepts Unrelated Concepts 


(0.96) (2.66) 


FIGURE 1 
MEAN Semantic Distances (D Scores) For CoLor-ReLaTeD TRrans OF Cotor (C), 
CoLor-Person (CP), AND ETHNIC (E) CONCEPTS, AND FOR NONCOLOR- 
RELATED TRIADS OF C, CP, AND E CoNcEPTS 
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D. SUMMARY 


This study of German Caucasian Ss was an attempt to replicate the findings 
of a study of American Caucasian Ss. In the American study, it had been found 
that triads of color, color-person, and ethnic concepts which were linked by 
the custom of “color-coding” racial groups (e.g., Black-Black person-Negro) 
were more similar in connotative meaning than were triads of concepts not 
linked by the color-code (e.g, Black-Red person-Oriental). The replication 
was successful, indicating that the previous findings were not specific to 
American culture and language systems but have greater generality within 
Western culture. The findings from both studies were interpreted as being 
consistent with the hypothesis that the designation of racial groups by color 
names influences the manner in which these groups are perceived. 
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SEX OF THE COMMUNICATOR AS A VARIABLE 
IN SOURCE CREDIBILITY*? 


North Dakota State University and Western. Washington State College 


James О. WurrrAKER AND Ковевт D. MEADE 


A. INTRODUCTION 


While a substantial number of experimental studies of communication and 
persuasion have been performed in the United States, barely a handful of 
cross-cultural investigations have been conducted. In Japan, for example, 
McGinnies (3) has replicated several American studies involving one-sided 
versus two-sided communications, order effects, and so on. In a number of 
African countries, Doob (1) has investigated such problems as the effects of 
the language of communication, the order of arguments, and one-sided versus 
two-sided communications. Finally, in Ecuador, the Agency for International 
Development? has investigated the effectiveness of different communication 
media in generating opinion change. Prior to the project of which the present 
experiment is a part, however, no investigator had ever attempted to compare 
several cultures systematically with respect to a number of variables suspected 
to be significantly involved in communication and persuasion. 

The present experiment focused upon cultural variables related to source 
credibility. Specifically, the problem concerned the credibility of male versus 
female sources where the issue involved was political in nature. The experi- 
ment was conducted in Brazil, Jordan, Rhodesia, India, and Hong Kong. In 
each country the same procedure was followed and the language of the area 
was utilized in both the questionnaires and communication. Thus the experi- 
ment was conducted in Portuguese, Arabic, English, Hindi, and Cantonese, 
respectively. 

B. METHOD 


1. The Issue and Communication 


It is now generally recognized among American investigators that studies 


involving political issues in foreign countries can easily get one into great 
= 

* Received in the Editorial Office, Provincetown, Massachusetts, on November 21, 
1966, and given special consideration in accordance with our policy for cross-cultural 


Tesearch. Copyright, 1967, by The Journal Press. f - 
1 Those Pic ol tia MEE dd in India and Rhodesia were directed by 


Professor Meade. 'The work in Brazil, Jordan, and Hong Kong was directed by Pro- 
fessor Whittaker. 
ersonal communication, 1963. 
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difficulty. At best one's financial support may be withdrawn if such difi- 
culties develop, and at worst one may be prohibited from leaving the country 
involved. Unlike American subjects, and particularly unlike American college 
students, many foreign subjects tend to be easily disturbed by questionnaires 
probing their political attitudes. Consequently, this created a problem in 
terms of selecting an issue to be used in the experiment. Another problem 
in selecting an issue involved the fact that subjects’ opinions are difficult to 
change if these opinions are strongly held, and particularly if the communica- 
tion is brief. A topic was needed which met the following criteria: (a) it 
should be meaningful in all the areas—i.e., subjects could be reasonably 
expected to have an opinion; (b) it should be one on which opinions were 
not likely to be extreme; and (c) it should not be one that would be disturb- 
ing to sensitive subjects. Н 

After some searching the authors settled on the issue of the effectiveness _ 
of the United Nations over the past 20 years. A one-thousand-word communi- 
cation was prepared and entitled "Evaluation of the United Nations Since its 
Founding.” This communication presented a strong favorable opinion of the 
activities of the United Nations, but at the same time it did not indicate that 
the United Nations was a perfect organization. 

In all countries, two tape recordings were prepared by professional radio 
commentators who presented the communication in the appropriate foreign 
language. In each country the communication was presented by a male com- 
municator on one tape and by a female communicator on the other. A differ- 
ent voice introduced the speaker, and a musical introduction was employed 
in the Chinese and Arabic tapes to increase the impression that 
was taken from an actual radio broadcast. 


In Brazil, Jordan, and Hong Kong, the same communication was also 
prepared in written form. In each of these countries, one form had a fictitious 


male "author's" name at the top, while the other had a fictitious female 
"author's" name. 


"To assess the positions of subjects both before and after the communication, 
a 30-item questionnaire was prepared and translated in 
The questionnaire consisted of both pro and con statements concerning the 
UN, with which the subject was asked to agree or disagree—i.e., “The UN 
is weak because it cannot finance its operations” ; “Since the UN has been a 
force for good, it should be strengthened”; “The UN should disband because 
it is a failure.” The same questionnaire w. 
Brazil. In the Brazilian questionnaire th 
statements, and subjects were simply aske 


the recording 


to the various languages. 


as utilized in all countries except 
ere were minor differences in the 
d to indicate "true" or "false" in 


ee 
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connection with each statement. In the other countries, subjects checked one 
of seven positions in connection with each statement, ranging from “strongly 
agree” through “undecided” to “strongly disagree.” 

To assess the subject’s perception of source credibility, a separate question- 
naire was prepared. In this questionnaire three primary items were involved: 
(a) “Do you think that the speaker (or article) was fair and honest, or did 
the facts seem too one-sided?” (5) “Were the conclusions presented justified 
by the facts?” and (c) “Do you feel the speaker (article) did a good or a 
poor job of giving the facts on the accomplishments of the United Nations?” 

In Brazil, Jordan, and Hong Kong subjects were also asked to check one of 

seven positions in connection with the question, “How convincing was the 

speaker (article) ?” These positions ranged from “Very convincing" (scored 
aas “7”) to “Very unconvincing” (scored as “1”). In addition they were also 
asked to check one word from each of nine pair of words which could be used 

» ts 


to describe the speaker (or author)—i.e., “honest-dishonest,” "interesting- 
“uninteresting,” “unfriendly-friendly.” 


2. Procedure 


The initial 30-item questionnaire was presented to college students in their 
classes by the professors teaching the courses. In each case they were told by 
the professor that he was conducting a study of student opinions concerning the 

United Nations, and they were asked to cooperate. Thus, the research was not 
presented as an American project. Each subject was asked to write his class 
in college, and sex, on the questionnaire, but not his name. 

——— One week later the subjects were told, by the same professor, that he was 

| conducting an experiment in the evaluation of different means of communica- 
tion. They were asked to listen to a tape recording of an “actual radio broad- 

- cast” (or to read an “editorial taken from a local newspaper”). Afterward, 
the professor explained they would be asked to answer some questions con- 
cerning the broadcast (or article). Following this they were again given 
the 30-item questionnaire on the United Nations. In connection with this, the 
students were told that not everyone had properly followed directions on 
the first administration. Consequently, a second administration was required. 

In each country one class was used for only part of the study. Thus, in 
each of Brazil, Jordan, and Hong Kong, four groups (classes) were utilized : 
(а) tape-male, (b) tape-female, (c) written-male, and (d) written-female. 
In each of India and Rhodesia, two groups (classes) were utilized : (a) tape- 
male and (5) tape-female. It should be emphasized that the communications 
Presented to all groups were identical in wording. Every attempt was made 


30 JOURNAL OF SOCIAL PSYCHOLOGY 


to match classes in terms of age, sex, and course of study—both within and 
between cultures. 


3. Subjects 


As mentioned above, college or university students were selected as subjects 
in each country. The total number of subjects involved was 549, and this 
number included 208 Chinese, 71 Indians, 64 Rhodesians, 142 Brazilians, and 
64 Jordanians. For the most part, subjects ranged in age between 18 and 30 
years, and they were largely enrolled in liberal arts courses. In terms of sex, 
the groups were roughly equal. 

Subjects in Rhodesia were all Africans enrolled in teacher's colleges, 
while the Jordanian subjects wert high school graduates enrolled in a pre- 
college summer course. The Chinese subjects were enrolled in two colleges 
of the Chinese University of Hong Kong, and the Brazilian subjects were 
attending a State University in Brazil. The Indians were attending a liberal 
arts college in the central part of India. 


C. Resutts 
1. Credibility of the Sources 


The overall results of the credibility questionnaires used with oral presenta- 
tions are given in Table 1. These percentages are based on combined responses 
to the first three items in the questionnaire. Thus 79 per cent of Brazilian 
subjects exposed to the male speaker gave favorable responses to the questions 
(a) “Was the speaker fair and honest or one-sided?” (b) “Were the con- 
clusions justified by the facts?” and (c) “Did the speaker do a good or a poor 
job in giving the facts on the UN?” On these same questions only 25 per cent 
exposed to the female-spoken responded favorably. A chi-square test of the 
difference was significant at the .005 level. For these same subjects, responses 
to the "convincingness scale” tended to reinforce evaluations given in the 
first three items (male-spoken 4.47; female-spoken 3.34), and the percentages 
of favorable adjectives checked for each also revealed a preference for the 
male presentation (93 per cent) over the female (76 per cent). 


TABLE 1 
PERCENTAGE OF FAVORABLE EVALUATIONS OF MALE vs. FEMALE SPEAKERS 


Cultural Male Female 
area N speaker N speaker chi square p value 
Brazil 34 79 31 25 10.3 
Jordan 18 50 11 61 10$ nas 
Hong Kong 40 89 50 79 4.50 <.05 
India . 36 80 35 60 10.69 <.005 
Rhodesia 33 74 31 69 27 n.s. 
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The difference between male and female speakers with Jordanian subjects 
was not significant. In this connection it was noted that initial attitudes 
toward the UN were far less favorable than those held by subjects in the other 
countries involved. For example, 58 per cent of the Arab subjects were 
initially unfavorable as compared with Brazilians (47 per cent), Chinese 
(38 per cent), Indians (37 per cent), and Rhodesians (15 per cent). Thus 
more than half of the Arab subjects initially held positions opposed to that 
presented in the communication. It is possible that, when an audience holds 
a view opposed to that presented in a communication, they tend to pay more 
attention to what is said than to who is saying it. Clearly, additional research 
needs to be done on the interaction between position of the audience relative 
to the communication, and variables related to the source. 

For Chinese subjects there was a significant difference in the perceived 
credibility of the communicators, as shown in Table 1. On the “convincingness 
scale” the score for the male communicator was 5.24 and for the female it 
was 4.84, while the number of favorable adjectives checked was 90 per cent 
for the male and 76 per cent for the female. Similarly, for Indian subjects, 
the male communicator was more favorably evaluated than the female. 
Finally, although there was a tendency for Rhodesian subjects to evaluate 
the male speaker more favorably than the female speaker, the difference was 
Not significant. 

The results of the credibility questionnaires used with written presentations 
are given in Table 2. As in Table 1, these percentages are based on combined 
responses to the first three items. A significant difference was observed in the 
Perceived credibility of male versus female “authors” among Brazilian sub- 
jects, although the difference was not as pronounced as with the speakers. 
The overall impression from comparing “speaker” and “author” data was 
that Brazilian subjects tended to perceive oral presentation as somewhat more 
credible than written presentation, regardless of the sex of the communicator. 
More striking, however, is the difference in the perceived credibility of male 
versus female communicators, regardless of the form in which the material 
is presented. When political or quasipolitical information is presented by a 
male, it apparently tends to be much more readily accepted. 


TABLE 2 
PERCENTAGE OF FAVORABLE EVALUATIONS OF MALE vs. FEMALE AUTHORS 
Cultural 1 Female } 
area N E N author chi square p value 
Brazil 63 22 43 5.38 <.025 
Jordan n 48 17 58 1.01 n.s. 


Hong Kong 38 83 80 84 68 ns, 
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With Jordanian subjects the difference in perceived credibility of male and 
female “authors” was not significant, and the same was true of Chinese sub- 
jects in Hong Kong. 


2. Attitude Change 


Table 3 consists of the mean difference scores on the attitude questionnaire 
administered before and after the oral communication. On the Brazilian ques- 
tionnaire a lower score was indicative of a more favorable change, while on 
the questionnaire used elsewhere a higher score was indicative of a more 
favorable change. It is to be noted that, in every country except Rhodesia, a 
greater change occurred after hearing the male communicator than after 
hearing the female. The only difference which is statistically significant, how- 
ever, is that found for Brazilian subjects. The Chinese and Arab data ap- 
proached significance at the .05 level. 


TABLE 3 
OPINION CHANGE AS A FUNCTION OF SEX ОР THE COMMUNICATOR 
(Oral presentation) 


Mean difference score —— 


Cultural Male _ Female 

area N communica’ N communicator t-test p value 
Brazil 25 9.87 22 9.95 2.30 <.05 
Rhodesia 33 10.17 31 10.25 59 n.s. 
India 36 10.50 35 1042 62 n.s. 
Hong Kong 25 10.31 33 10.15 1.41 П... 
Jordan 13 10,02 11 9.76 1.07 n.s. 

&, 


"Table 4 presents the mean difference scores for the written communication. 
Although none of these differences is significant, they all show greater change 
toward the position of the communicator after reading the artic е supposedly 
written by the male author. R 


TABLE 4 р * 
OPINION CHANGE AS A FUNCTION OF SEX OF THE COMMUNI R 
(Written presentation) 


Mean difference score 


Cultural Male Female 
area N communicator N communicator t-test $ value 
Brazil 36 9.88 16 9.94 1.34 n.s. 
Jordan 13 10.15 7 9.79 1.91 n.s. 
Hong Kong 20 10.38 18 10.25 93 n.s. 


^ t 
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D. Discussion 


Source credibility in communication has been defined by Hovland (2) in 
terms of two aspects—expertness and trustworthiness. Thus a highly credible 
is one perceived as knowledgeable, or at least as being in a position 
to have the facts, and as trustworthy: i.e., having no particular reason to 
falsify or distort the truth. 
Bn... the results of the present experiment it is apparent that male sources 
are perceived as more credible than are female sources in Brazil, Hong Kong, 
т India, but not in Jordan ог Rhodesia.* The data also suggest that the 
ia of communication is important relative to the perceived credibility of 
the source. A male source presenting an oral communication is perceived in 
— "several cultures as significantly more credible than a female, However, with 
D exception, when the communication was written, no differences were ap- 
irent. One possible explanation here is that the sex of the communicator is 
not nearly as apparent in written communication as in oral. In fact, in many 
cases subjects may not even notice whether the communicator is male or female 
when the material is presented in written form. 

Finally, there is a suggestion in the data that the communicator (regardless 
of sex) is perceived as more credible in oral than in written presentation. This 
may be because the subject has no oppo to review points with which 
he disagrees when the communication is . In written communication, on 
“the other hand, he has the opportunity to reread points with which he takes 

La Consequently, the overall credibility of the communication is diminished. 


E. SUMMARY j 


__ The credibility of male versus female communication sources was evaluated 
- among 549 college-age subjects in Brazil, Hong Kong, India, Rhodesia, and 
- Jordan. When the presentation was oral (tape-recorded), male sources were 
perceived as significantly more credible than female sources in. Brazil, India, 
and Hong Kong. In written presentations, only Brazilian subjects perceived 
] _ the male source as significantly higher in credibility. The differences in other 
- countries were not significant. The data also suggest that the communicator 
ылы of sex) is perceived as more credible in oral than in written 
_ presentations. 


E [72-4 
F3 
In a previous experiment in the United States, male sources were found to be 


ificantly more influential in opinion change than were female sources (4). 
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N EXPERI MENTAL STUDY OF LEADERSHIP IN INDIAT! 
Western Washington State College 


Ковкат D. Малов 


А. IwTRODUCTION 
experimental studies of Lippitt (10) and Lippitt and White (11) 
rated that democratic leadership generally produces higher morale, 
aggression, and greater productivity than does authoritarian leadership. 
Shaw (14) found that authoritarian leaders produce a better performance but 
ower morale than do democratic leaders in a sample of college men. His 
tions, however, were over a much shorter period of time than were 
of Lippitt and White. Greer (3) found that people in a neighborhood 
бо are nominated as leaders by their peers tend to be more equalitarian or 
lemocratic than are those not nominated, and Hollander (5) reports а nega- 
ve correlation between authoritarianism and leadership nomination in a mili- 
‘Tt should be noted that all of the studies cited here have used Ss or respon- 
who have grown up in the American culture, which has had a long 


Tian might react differently from people who are more democratic, and a 
t y by Haythorne et al. (4) gives some evidence that this is the case, They 
n a tendency for increased morale, more effective communication, and 
td cooperation when both leaders and followers score high on a test of 
tarianism. An extensive study by Sanford (13) using questionnaire and 

data also suggests different reactions from people who are high in 
tarianism. He reports that adults who score high on scales designed to 

authoritarianism prefer “. . . status ladened leadership, to accept 
directive leadership and to talk in terms of ‘power’ words when cate- 
izing ‘good’ leaders” (13, p. 170). He found such concern with power 
ind strong direction from leaders to be considerably less among people who 
ге low on the same scale. 
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The trends reported by these studies suggest that, in a culture which is 
more strongly authoritarian than that in America, authoritarian leadership 
would produce better morale and productivity than would democratic leader- 
ship, results essentially opposite from those of Lippitt and White. That such 
a culture exists in the Republic of India is evidenced by data reported by 
Meade and Whittaker (12), who found that a sample of college students in 
that country scored significantly higher on the California F Scale than did 
five other national groups. Related to this, a study by Ghei (1) reported that 
both male and female college students score higher on the deference scale of 
the Edwards Personal Preference Schedule than do American students. Socio- 
logical and anthropological accounts of family, religious, and school organi- 
zations have also given adequate documentation for this conclusion [see 
Lewis (9)]. 

Studies of leadership in India have been confined to those employing ques- 
tionnaires [e.g., Govindarajan (2) and Krishnan (8)] or general survey 
methods [e.g., Kothurkar and Pendse (7) and Inamdar and Pendse (6)]. 
The present study was undertaken to determine the effect of authoritarian 
and democratic leadership on productivity and morale in the atmosphere of 
a boys’ club in India. 


B. Мвтнор 
1. Subjects 


Twenty-four grade-school boys between the ages of 10 and 11 living in 
‘a city in northern India acted as Ss. They were recruited from their school 
as volunteers to join a club for the purpose of making a number of useful 
and artistic articles. All Ss were Hindus and the four major caste groupings 
were represented. 


2. Apparatus 


Materials and tools necessary for the construction of kites, maps, and crayon 
drawings were used throughout the progress of this investigation. 


3. Independent and Dependent Variables 


The two conditions of the independent variable were the type of leadership, 
authoritarian or democratic, which was exercised by the leaders in a club-like 
atmosphere. The dependent variables, measures of morale, were the total 
number of absences during the progress of the study, the number of Ss indicat- 
ing a desire to continue with the club activities at the conclusion of each part 
of the study, and expressions of preference for the authoritarian or democratic 
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ader. Productivity was assessed through ranking the quality of work done 
en the projects had been completed. As the study progressed and most 
ps did not complete all of the work in the allotted time, it was possible 


projects, and the total number of man-hours required to complete the 
ects. These latter measures, while not entirely independent of each other, 
ded additional assessments of productivity effectiveness. 


4. Procedure 


For three weeks prior to the beginning of the experiment proper, the two 
third-year college men hired for the project, were coached on the 
ques of both authoritarian and democratic leadership described by Lippitt 
White (11, p. 487). Two observer-assistants were trained at the same 
‘and later were assigned to one of the leaders to observe his behavior as 
ll as that of the participating group members. The chief function of these 
servers was to watch the leaders and determine that they were exercising 
he role appropriately and especially to assure that the democratic atmosphere 
not take on characteristics of laissez-faire leadership also described by 
pitt and White but not tested here. 

During the training period the leaders also worked with another boys' club 
neighboring school to practice each type of leadership under E's direction. 
fore the experiment proper began, four groups of boys with six members 
ch were formed. Each group was made up of boys who had nominated 
other as those with whom they would like to work. Assignment of the 
s to the different experimental conditions was done randomly. 'Two 
one with authoritarian leadership atmosphere and one with democratic 
ship atmosphere, met on Mondays and Wednesdays, while the other two 
ups with a similar leadership arrangement met on Tuesdays and "Thursdays. 
der ^A" exercised authoritarian leadership with the Monday-Wednesday 
1р and democratic leadership with the Tuesday-Thursday group. Leader 
exercised democratic leadership with the Monday-Wednesday group and 
thoritarian leadership with the Tuesday-Thursday group. 
Tt was originally intended that all groups work on three projects, the com- 
of which was estimated to require six weeks although no time limit 
ggested to the boys. In actual practice, three of the four groups required 
tional meetings to complete their projects. Recess, for a week, was observed 
€ the club meetings resumed again. On resumption, all groups changed 
tship in such a way that each one was now operating under a different 
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leader as well as a different leadership atmosphere. As an example, boys who 
had experienced democratic leadership from Leader “A” worked the second 
six-week period under Leader “B” who exercised authoritarian leadership. 
They also worked on a different but similar set of projects. 

Each boy worked with a teammate of his own choosing on three projects 
during each six-week period and on three similar projects during the second 
six-week period, and mutual assistance between members of different teams 
was neither encouraged nor discouraged. In the democratic groups, the leader 
suggested 10 different projects on which the boys could work and encouraged 
them to suggest other tasks that might be done. From the materials available, 
however, only three of them appeared to be feasible at that time and these 
were soon selected. These same projects were arbitrarily assigned to the 
authoritarian groups. The projects were as follows: make a crayon copy of 
a picture of Taj Mahal, construct a flat kite, make a map of the world, make 
a crayon copy of a picture of Red Fort, construct a box kite, and make a 
map of India. Each set of teammates worked on and completed all six projects. 
The complete schedule of leaders, leadership atmosphere, and projects was 
counterbalanced. All four groups experienced both leaders, both atmospheres, 
and construction of all projects. 

Two different clubs met simultaneously for one-hour sessions in the late 
afternoon in two different rooms of the same school. The clubs terminated 
when all sets of teammates had completed all six projects. On termination, 
each 5 was asked by the observer to state under whose direction, Leader “А” 
or Leader “B,” he would like to continue were the activities to resume. 

Finally, five teachers from the school where the clubs met, who prior to 
this time had not witnessed the activities of the clubs, were asked to rank 
each set of six projects in terms of quality. A rank of 1 was to be assigned 
to the sample whose workmanship was considered best and a rank of 12 to 
the one considered poorest. Work was to be judged in terms of neatness, pre- 
cision, and freedom from errors. The average ranking for each of the six 
projects was determined and was used as a qualitative measure of output under 
authoritarian and democratic leadership. 


C. RESULTS 


Data for all groups working in an authoritarian leadership atmosphere were 
pooled, as were the data for all groups working in a democratic atmosphe 
"The total number of absences, number wishing to continue, number of proje 
completed on time, extra days required, and total man-hours required for each 
condition of leadership are presented in Table 1. The number of boys expr 
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TABLE 1 
MEASURES OF MORALE AND Propuctiviry IN DEMOCRATIC 
AND AUTHORITARIAN LEADERSHIP IN INDIA 


Leadership 
effectiveness Leadership atmosphere 
measure Democratic Authoritarian x? 
Total absences 44 15 13.28** 
Number wishing to continue 12 23 2.86 
Projects completed on time 19 31 2.42 
Extra days required 16 6 9.44** 
Total man-hours required 340 287 4,48* 
Preferred leader 3 21 12.04** 
Quality of work 11 25 5.69* 
"r$ < .05. 
ee p< .01. 


ing a preference for the authoritarian and democratic leaders is also presented 
in this table. Finally, the number of projects whose average rank placed them 
qualitatively in the top half of all projects in its class is presented in the last 
line of Table 1 in the category, “quality of work.” 

The statistic x2, corrected for continuity, has been applied to these data 
and is also reported in Table 1. The expected frequencies were determined 
by assuming that their division was equally probable for each type of leadership. 

At the .01 level of confidence, there was a greater rate of absenteeism in 
the democratic than in the authoritarian groups, the boys preferred their 
authoritarian leader over the democratic one, and fewer days were required 
to complete the work under the authoritarian leader. At the .05 level of 
confidence, a greater number of man-hours were required to complete the 
projects in the democratic groups and the work was judged to be of better 
quality in the authoritarian groups. 

While there was a tendency for a greater number of Ss working with the 
authoritarian leadership atmosphere to wish to continue the club activities, 
the difference was not significant. There was also a tendency for the democratic 
groups to finish fewer projects in the allotted time but this, too, was not 
Statistically significant. 

The data for all conditions were also pooled for each leader as an individual 
without respect to the type of leadership atmosphere he was exhibiting. Leader 
“A” was slightly superior to Leader “В” by having more projects completed 
in the allotted time (26 opposed to 24), having more Ss prefer his leadership 
(13 opposed to 11), and quality of work done under his leadership (20 in 
top half as opposed to 16). Leader “B” was slightly superior to Leader “A” 
with respect to number of Ss wishing to continue with the club activities (18 
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vs. 17), fewer absences (26 vs. 33), fewer extra days required to complete 
the projects (8 vs. 10), and total number of man-hours utilized (312 vs. 315). 
When these data were compared with the same statistic, y? corrected for 
continuity, none approached a level of confidence sufficient for rejection of 
the null hypothesis. 

The Lippitt and White study made elaborate report of instances of coopera- 
tion, rivalry, hostility, and aggression exhibited by Ss. No such assessment 
was undertaken in the present investigation. However, when each leader and 
assistant was asked at the end of the study to attempt a general retrospective 
statement relevant to the tendencies toward aggression in each type of 
atmosphere, all four agreed that there was more aggression in the democrat- 
ically led groups than under authoritarian leadership. 


D. Discussion 


Except for the type of projects on which Ss worked and a slight increase 
in group size (six instead of five), the conditions of the present experiment 
were similar to those used by Lippitt and White. However, more reliance was 
placed on quantitative data and no transcripts of conversations among Ss were 
made. Thus, no formal quantitative assessment of tendencies toward harmony 
or discord in either kind of leadership atmosphere could be made. Both the 
leaders and assistants expressed the belief that fewer acts of aggression occurred 
in conditions of authoritarian leadership than under democratic leadership. 

When one considers the quantitative measures, a number of conclusions can 
be made. The number of projects completed in the allotted time, the extra time 
required, and the total number of man-hours utilized indicate that Ss whose 
leadership was authoritarian were the more productive. The rankings of the 
excellence of work done by Ss demonstrated that work done under authori- 
tarian leadership was of a higher quality than that done under democratic 
leadership. That the morale of Ss working under authoritarian leaders was 
higher than that in democratic leadership was indicated by lower rate of 
absenteeism, the greater number wishing to continue in the authoritarian 
conditions, and the number of Ss expressing a preference for the authoritarian 
leader. These results obtained with Ss in India are, then, opposite from those 
of Lippitt and White who used American Ss. 

Lippitt and White explain their results by assuming that leadership effec- А 
tiveness is conditioned to a great extent by the degree to which the followers’ 
individual and group needs are met. They assumed that fewer needs were 
frustrated where the leader allowed a free expression of these needs and their 
modes of satisfaction. Such expression was possible and follow-through more 
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efficient where the leader was democratic. Less satisfaction was attained where 
the leader acted in an authoritarian manner and thus minimized the expression 
or satisfaction of the needs of the group and its members. 

While the data of the present study support the opposite conclusions, there 
need be no alteration in their theoretical explanation. The critical factor here 
appears to be that leadership effectiveness is related to satisfaction of followers’ 
needs and the efficiency with which the leader promotes them. The authori- 
tarian social and family structure typical of India is such that children grow 
up being told what to do and to have decisions made for them. Later, this 
same kind of atmosphere is carried forward into the schools. They have, there- 
fore, had little opportunity to learn to rely on their own initiative or that of 
others their own age which made up the group membership of the present 
study. As a result, when leadership is democratic, few Ss were capable of or 
willing to participate by making suggestions or in decision making. Such a state 
of affairs led to a lowering of both the quantitative as well as the qualitative 
aspects of productivity. Failures, such as these, would be expected to contribute 
as well to a higher level of frustration and a lowering of morale. 

The authoritarian leadership, being similar to that to which Ss were better 
accustomed, led to a more comfortable situation for them. Needs were satisfied 
more adequately and fewer frustrating situations occurred. Being conditioned 
to authoritarian leadership, individuals as well as the group functioned more 
effectively and both the quantity and quality of work remained high. Such an 
atmosphere of accomplishment would be expected to produce the higher level 
of morale which was observed in the authoritarian, as compared to the demo- 
cratic, conditions. 

These results, along with those of Haythorne е? al. (4), demonstrate. that 
cultural and personality differences are important considerations determining 
the effectiveness of the type of leadership atmosphere. Authoritarian depen- 
dency, where it exists, is more likely to be fulfilled where the leader acts in an 
authoritarian way. In individuals and cultures which are less authoritarian, 
such leadership is less effective. 

The present results were obtained from a rather specialized type of situation 
and age group for the purpose of comparability with the Lippitt and White 
study. Generalizations concerning the effectiveness of leadership in other situa- 
tions within the same culture or to the other subcultures, religions, and castes 
of India involve considerable risk, and further study would be required to 
ascertain the relevance of the results of this study for larger group situations, 
such as that found in government. In spite of the authoritarian structure of 
life in India, the long history of authoritarian rule by foreign nations and local 
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monarchs is generally regarded as instrumental in their choosing a democratic 
form of government on achieving independence in 1948. The results presented 
here suggest that a more autocratic form of government might have been more 
satisfactory. However, democratic procedures are now being taught and em- 
phasized in the schools and there is also considerable enthusiasm for participa- 
tion in national and local governments and elections (15). The joint family 
system with its tendency to strong authoritarian leadership is also beginning 
to break down [see Lewis (9) ]. If tendencies such as these continue, follow-up 
studies in future years may very well reveal results that are more congruent 
with those of Lippitt and White. 


E. SuMMARY 


In a boys’ club setting, groups of six 10-year-old Hindu boys living in 
northern India worked with both an authoritarian and a democratic leader. 
Subjects worked for six weeks under each kind of leadership and completed a 
total of six projects. Measures of absenteeism, the wish to continue club activ- 
ities, and expressed preference to continue under one particular leader indicated 
that morale was higher under the authoritarian leadership atmosphere than 
under the democratic leadership atmosphere. Number of projects completed on 
time, extra days required, and total man-hours expended indicated that pro- 
ductivity was higher under the authoritarian leader than under the democratic 
leader. The quality of work done under the authoritarian leader was judged 
to be superior. These findings, opposite from those made on American Ss by 
Lippitt and White, are explained by assuming that, in a culture with a rela- 
tively high level of authoritarianism, needs are more adequately met in an au- 
thoritarian leadership atmosphere than in a democratic leadership atmosphere. 
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SAMOAN INTERPERSONAL VALUES* ! 
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Jarrett W. GARSEE AND ALFRED Е. GLIXMAN 


A. INTRODUCTION 


Anthropologists who have written about Eastern (American) Samoa (2, 
3, 7, 8, 10, 13, 14, 15, 16, 17) are consistent in emphasizing the matai system 
as an integral part of Samoan culture. The matai system is an extended kin- 
ship system which is dominated by a chief (matai). The extended clan func- 
tions as a socioeconomic unit which provides a definition of a member's status 
in the community. As described by various writers, the system is strong, flexible 
(in the sense that social mobility and a relatively large range of specific be- 
haviors are permitted), and highly stable. Individual members are described 
as accepting and following group norms without difficulty, as being helpful 
and generous toward other members of the clan, and as taking a great deal of 
pride in their own accomplishments and in the accomplishments of the clan 
and the metai. Pride, however, is not attached to an individual qua individual, 
but always to an individual as reflecting the worth of the larger group; there- 
fore, there is little concern with seeking individual recognition. Nor is inde- 
pendence of action or thought highly valued. 

The authors’ major interest is in assessing some of the changes that occur in 
the American Samoan culture as a consequence of the widescale infusion of 
western cultural characteristics (9). That such changes are taking place and 
should be expected to take place at an accelerated rate is indicated by the 
expenditure by the United States of over $40,000,000 for new roads, new 
schools, а new power plant, a new jet airstrip and airline terminal building, a 
new hotel, improved sanitation and public health services, and a complex edu- 
cational television system (12). A new hospital, a new telephone system, and 
a new radio station are in the planning stage. The rapidly changing economy, 
and the increased emphasis on universal education (with a consequent bilin- 
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guality of nearly all children) presage rapid cultural changes. In order to 
evaluate these changes—and, possibly, to help make these changes smooth 
ones—it is necessary to have a technique which permits a quantitative evalua- 
tion of change over time. 

Because of the consistency of observational conclusions about roles of соп- 
formity, benevolence, personal recognition, and independence, it is desirable 
that any measurement device that is used should deal with these variables; in 
effect, the accuracy of the technique may be evaluated by comparing its results 
against previous observations. Gordon’s Survey of Interpersonal Values (4, 
5, 6) deals with six value dimensions (Support, Conformity, Recognition, In- 
dependence, Benevolence, and Leadership), four of which seem directly appli- 
cable (Conformity, Recognition, Independence, and Benevolence) to the 
description of Samoan culture. In the main, Samoans may be expected to score 
high on Conformity and Benevolence, and to score low on Recognition and 
Independence. 

Since there was no prior administration of the Survey of Interpersonal 
Values (SIV), any statement about change must be inferred from a comparison 
of different groups taking the SIV at the same time. An older high school 
group would have experienced some of the effects of cultural disruption that 
started with World War II, and would be oriented toward finding a soci 
and vocational place in a new social structure. For this reason and because 
Gordon (6) has presented normative data for high school and college students. 
in Japan and in the United States, high school seniors were chosen to be sub- 
jects in the current study. These subjects can be divided in a variety of ways. 
The ones that seemed most meaningful to the authors were to separate them 
according to academic curriculum and sex. For boys, three programs (Divi- 
sions) are available: Academic, Commercial, and "Technical. Girls may enroll 
in either an Academic or a Commercial Division. For reasons to be explicated 
in the Discussion, the authors expected that there would be value differences 
among the Divisions and that these differences would not be identical for both: 
sexes. Such differences may reflect differential sensitivity to factors affecting. 
cultural change. 

In addition, the authors decided to give the SIV in both English and Samoan 
forms. Value differences across forms would be consistent with expectati 
based upon a consideration of the directive properties of language (19). B 
these differences interact with Division membership, then increased evidence 
for differential sensitivity to culture-change factors would be provided. 

Specifically, then, the purpose of this study is to investigate the order o! 


б 


values as determined by the Survey of Interpersonal Values for a sample ol 
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American Samoan high school students, and to see whether internal evidence 
of culture change can be adduced. 


B. Метнор 


The values of a group of Samoan high school students were studied by 
analyzing their responses to the Gordon Survey of Interpersonal Values. 


1. Subjects 


All 139 students (93 boys, 46 girls) from the senior class of the High 
School of American Samoa were used as the basic pool of individuals from 
which Ss were drawn. The high school program contains three major Divisions 
for Boys (Academic, Commercial, and Technical) and two major Divisions 
for Girls (Academic and Commercial). Within each Division there were two 
classrooms: one was used to test the students in English and the other was 
used to test them in Samoan.? The difficulties involved in long-distance com- 
munication and administration of the test, between Oklahoma and Samoa, 
resulted in all of one classroom group taking either the English or Samoan 
version of the test. The procedure created confounding of classroom and lan- 
guage, and also made later analyses more difficult because of unequal sample 
sizes in the classification cells. Eight boys were eliminated in order to facilitate 
the analysis or because their tests were faulty. 


2. Test 


The Survey of Interpersonal Values (4) endeavors to define the values of 
the individual by those scales on which the individual scores high. An indi- 
vidual who scores low on a scale simply does not value what is defined by that 
particular scale. A full description of the scales (Support, Conformity, Recog- 
nition, Independence, Benevolence, and Leadership) may be found in Gordon 
(5). All tests were scored according to the standard key. 


3. Procedure 


The SIV was administered to the various classes by a member of the 
Counseling Department of the High School of American Samoa? (English) 


2 Throughout this report, “(Samoan)” and “(English)” will be used to indicate 
different language versions of the SIV or the groups taking these different versions. 
he authors wish to express their appreciation to Mr. Dave Chitlock for adminis- 
tering the tests, for supervising the translations, and for coping with all of the on-the- 
spot problems that inevitably arise in long-distance supervision of research. They also 
wish to thank Mr. Ale Va'alele for providing the translation from English to Samoan, 
Mrs. Sa’en Scalan for checking the accuracy of the translation, and Mr, Soli 
Aumoeualogo for providing a “back-translation” from Samoan to English, 
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subjects read the complete instructions with the administrator before taking 
the test, (Samoan) subjects were given the same instructions orally. A 
sample item was presented on the blackboard, and an opportunity to ask ques- 
tions was provided before the actual test was begun. 


C. RESULTS 
1. Intra-Samoan Comparisons 


Because the Girls were distributed over classification cells in a manner 
that precluded the usual orthogonal analysis of variance, the analyses for 
Boys and Girls are presented separately. For each sex, there is an overall 
three-way (Scale X Division X Language) analysis. This analysis is followed 
by subanalyses of significant effects, and these analyses are followed by sex 
comparisons. 

It should be noted that the analyses of variance were conducted with re- 
duced degrees of freedom. Since each person's score summed to 90, all sums 
of squares based on differences among individuals (i.e., Divisions, Language, 
Division X Language, and $ within Division X Language) vanished ; there- 
fore, all "between S" effects and their degrees of freedom were eliminated from 
the analyses. 

Table 1 contains the distribution of Boys’ means over Scale, Language, and 
Division. Table 2 contains the summary of the analysis of variance of these 
means, All of the effects were significant. 


TABLE 1 
MEAN SCALE Scores For Boys over Division AND LANGUAGE 
Scales 

Boys N S € R I B L 
Academic 

(English) 16 12.25 19.25 7.88 16.25 19.69 14.69 

(Samoan) 13 13.38 2246 9.62 8.82 25.23 10,69 
Commercial 

(English) 11 13.73 16.73 10.73 12.36 19.36 17.09 

(Samoan) 8 14.75 20.00 11.13 6.63 22.00 15.50 
Technical 

English) 20 13.25 17.25 11.74 11.90 18.85 17.00 

(Samoan) 17 13.65 19.35 11.71 10.76 19.41 15.12 


a. Scale X Division X Language effect. The meaning of this second-order 
interaction may be discerned from Table 3. Table 3 contains Scale X Division 
Language differences and tests of the Division-differences of those Language 
differences, The mean square for error found in Table 2 was used in all 
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TABLE 2 
ANALYSIS OF VARIANCE OF ScALE-SCORE MEANS FoR Boys 
(N = 85) 
Source 8. 8. df MS F 
Scales 6880.80 5 1376.16 92.86* 
Scales X Divisions 764.51 10 76.45 5.16" 
Scales X Languages 863.35 5 172.67 11.65* 
Scales X Divisions X 
Language 335.53 10 33.55 2.26* 
Scales X Subjects 
within Divisions X 
Languages (Error) 5854.81 395 14.82 


Bs 05. 


tests of significance for Boys. The tests of significance (18) reveal that Tech- 
nical Boys differed from Academic Boys and Commercial Boys with respect to 
Language differences on Independence, and that they differed from Academic 
Boys on Benevolence. Academic Boys did not differ from Commercial Boys 
with respect to Language differences on any of the scales. Another way of 
looking at the Scale X Division X Language interaction is to compare the 
Language differences over Scales for each of the Divisions (see upper half of 
Table 3). Academic Boys (English) scored higher than Academic Boys 
(Samoan) on Independence and Leadership, and they scored lower on Con- 
formity and Benevolence. Commercial Boys (English) scored higher than 
Commercial Boys (Samoan) on Independence. Technical Boys showed no 
significant Language differences. It can be seen that Academic Boys were 
affected most by the Language differential, and that Technical Boys were 
affected least. 


TABLE 3 
Boys’ (ENGLISH)-(SAMOAN) DIFFERENCES OVER SCALE AND DIVISION, 
AND COMPARISON BETWEEN DIVISIONS 


Scales 
Variable N S C ROT B zi 

Division 

Academic 29 —113 —3.21% —174 7.69* —5.54* — 4.00* 
Commercial 19 —1.02 —3.27 — 40 5.73" —2.64 1.59 
Technical ag. — 40 —2.10 04 114 —.56 1% 
Comparison 

Academic-Commercial E 06 —134 190 —290 . 241 
Academic-Technical | 1.78 —141 —148 649* —498* 242 


Commercial-Technical 


* p < .05, two-tailed t-test. 
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b. Scale X Division effect. Table 4 contains the information needed to 
understand the Scale X Division interaction. Academic Boys scored higher 
than Commercial Boys on Conformity and Independence, and they scored 
lower on Leadership, Academic Boys scored higher than Technical Boys on 
Conformity and Benevolence, and they scored lower on Recognition and Lead- 
ership. Commercial Boys were not different from Technical Boys on any of 
the scales. 


TABLE 4 
Mean ScaLe Scores ror Boys over Division 
Scales 
Variable N s © R I B L 
Division 
Academic 29 12.76 20.69 8.66 12.83 22.17 12.90 
Commercial 19 14.16 18.11 10.89 9.95 20.47 16.42 
Technical 37 13.43 18.22 11.73 1138 19.11 16.14 
Comparison 
Academic- 
Commercial — 140 2.588* — 223 2.88* 1.70 — 3.52* 
Academic- 
"Technical — 67 247* — 3.07° 145 3.06 — 3.24* 
Commercial- 
"Technical 43 — uU — .8& — 143 1.36 2% 


* p < .05, two-tailed t-test. 


€. Scale X Language effect. The data for the Scale X Language inter- 
action are presented in Table 5. Boys (English) scored significantly higher 
than Boys (Samoan) on Independence and Leadership, and they scored lower 
on Conformity and Benevolence. 


TABLE 5 
MEAN SCALE Scores For Boys over LANGUAGE 
Scales 
Variable N S С R I B L 
(English) 47 13.02 17.81 10.19 13.49 19.26 16.23 
(Samoan) 38 13.79 20.55 10.87 9.16 21.95 13.68 
Comparison — 77 — 2.74% — 68 4.33% — 2.69* 2.55* 


* p < .05, two-tailed t-test. 


d. Scale. 'The Newman-Keuls test (20) was applied to the scales which 
were ranked and grouped from low to high for all groups of subjects: total, 
two Language groups, three Division groups, and six Division X Language 
groups. In all rankings, Recognition is in the lowest set (i.e., least valued) and 
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ynformity and Benevolence are in the highest set. Independence is in the 
lowest set of scales for all groups of subjects except Academic (English). 
‘able 6 contains the distribution of Girls’ means over Scale, Language, 
and Division. Table 7 contains the summary of the unweighted-means analysis 
of variance (20) of these means. Only Scale and Scale X Language effects 
ire significant. 


TABLE 6 
MEAN SCALE Scores For GIRLS OVER Division AND LANGUAGE 
р Scales 
Division N 8 © R I B L 
Academic 
(English) 7 13.43 20.00 10.57 12.71 20.29 13.00 
(Samoan) 10 14.70 20.20 10.80 10.80 21.20 12.30 
mmercial 
(English) 18 12.83 19.17 9.17 14.11 19.22 15.50 
(Samoan) 11 12.27 22.00 9.09 10.55 22.18 13.91 
TABLE 7 


ANALYSIS OF VARIANCE (N Pg din MEANS For GIRLS 
= 46 


Adjusted 
Source $. S.a df MS F 
4127.91 5 825.58 68.06* 
Scales x Divisions 97.57 5 19.51 161 
Seales X Languages 152.57 5 30.51 2.52* 
Seales X Divisions X 
- Languages 47.30 5 946 78 
Seales X Subjects 
Within Divisions x Li 
Languages (Error) 2548.14 210 12.13 


_* Each SS except Error multiplied by #, = 10.26. 
>< 05. 


e. Scale X Language effect. Table 8 contains the information needed to 
Understand the Scale Language interaction. Girls (English) scored sig- 
nificantly higher than Girls (Samoan) on Independence, and they scored lower 
on Benevolence. (The mean square for error found in Table 8 was used to 
evaluate all comparisons. ) 

_ f. Scale. Using the Newman-Keuls tests for all possible groups of Girls, 
R ecognition invariably is in the lowest set, and Conformity and Benevolence 
are in the highest set. 
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TABLE 8 
Mean SCALE Scores ror GIRLS over LANGUAGE 
Scales 
Variable N S c R I B L 
(English) 25 13,00 19.40 9.56 13.72 19.52 14.80 
(Samoan) 21 13,43 21.14 9.90 10.67 21.71 13.14 
Сотрагіѕоп — 43 — 1.74 — .34 3.05. — 2.19* 1.66 


* p < 05, two-tailed t-test. 


2. Sex Comparisons 


А comparison of Table 2 (Boys’ Analysis of Variance) and Table 7 (Girls 
Analysis of Variance) shows that the analysis of data for Girls yielded fewer — 
significant effects than did the analysis for Boys. One factor involved in this 
may be that the unweighted-means method is not so precise a test as the other 
analysis. Because the analyses could not be carried out simultaneously, many 
of the sex comparisons are of necessity only inferential. 

a. Scale X Division X Language effect. Academic Boys were most A 
by language, showing significant differences on four scales; Technical Boys 
and Academic Girls were least affected, showing no Language differential. 
Commercial Girls showed significant Language differences on three scales: 
Commercial Girls (English) (mean — 14.11) scored higher than did Com- 
mercial Girls (Samoan) (mean = 10.55) on Independence (7 = 2.14, 
df — 27); Commercial Girls (English) (mean — 19.17) scored lower than 
Commercial Girls (Samoan) (mean = 22.0) on Conformity (# = 2.69, 
df = 27); and Commercial Girls (English) (mean = 19.22) scored lower 
than Commercial Girls (Samoan) (mean = 22.18) on Benevolence (/ = 2.24, 
df = 27). Commercial Boys (English) scored higher (mean — 12.36) than 
Commercial Boys (Samoan) (mean = 6.63) on only one scale, Independence 
(t = 3.20, df = 17). 

In the Academic-Commercial comparisons of Divisions-differences of Lan- 
guage differences, neither Boys nor Girls showed significant differences. 

b. Scale X Division effect. In testing the interactions of Scale x; Division 
for sex comparisons, significant differences were found only for the following: 
Academic Boys (mean = 8.66) scored lower than Academic Girls (mean = 
10.71) on Recognition (¢ = 2.19, df = 44), and Commercial Boys (mean 
= 18.11) scored lower than Commercial Girls (mean = 20.24) on Confor- 
mity (2 = 2.15, df = 96). It seems that Divisions have a greater effect оп 
Boys than on Girls, since there were significant differences between Academic 
Boys and Commercial Boys on three of the scales (see Table 4), while 7-tests 
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reveal significance for only one scale in the Scales X Division comparisons 
for Girls: Academic Girls (mean = 12.59) scored lower than did Commercial 
Girls (mean — 14.9) on Leadership (¢ = 2.1, df = 46). 

с. Scale X Language effect. Using t-tests to compare Boys (English) with 

3 Girls (English), and Boys (Samoan) with Girls (Samoan), no significant 

nces were found, A comparison of Table 5 (Boys’ Language effects) 

and Table 8 (Girls’ Language effects) seems to imply that Languages affect 

| 4 Boys more than Girls, since there are significant differences on four of the 
* scales for Boys and significant differences on only one scale for Girls. 

x d. Scale. One of the interesting findings of this study is the fact that 

mo significant differences exist between the sexes on any of the scales. 

‘able 9 contains scale means, standard deviations, and comparisons 

for these means. A Sex X Scale analysis of variance was conducted to test 

1 for sex differences. There was a significant Scale effect (F = 118.47, df = 

—— 5/645), but there was no significant Scales X Sex interaction (F = 1.18, 

, df = 5/645). Tests for sex difference were conducted on each scale. None 

was significant. The Newman-Keuls comparisons for Boys and Girls show 

similar patterns; Recognition is in the lowest set, and Conformity and Benev- 

{ Olence аге in the highest set. 


TABLE 9 


MEAN SCALE SCORES AND STANDARD DEVIATIONS OVER SCALE AND SEX 
Scales ^ 
Sex s c R I B L 
Boys (N = 85) 
x 136 1903 1049 1155 2045 15.09 
5р 298 374 331 41 ЖӨ; 16 
Girls (N = 46) 
E 1120 2020 972 1233 2052 7444 
$D $346 зв 378 432. 1476 
Comparison 16 147 277 — 48 — 49 1% 
Compari " 


yf 3. Cross-Cultural Comparisons 


М Gordon (6) published the results of administering the SIV to samples of 

American high school and of Japanese college students. "These data and those 

- obtained from the Samoan sample appear in Tables 10, 11, and 12. 

— Boys. Table 10 contains Scale means, standard deviations, comparisons, 

_ and tests of significance for Samoan, American, and Japanese Boys. Samoan 
Joys scored higher than American Boys on Conformity and Benevolence and 

(ey scored lower on Support, Recognition, and Independence. Samoan Boys 


wh 
xs 
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TABLE 10 
Mean SCALE Scores, STANDARD DEVIATIONS, AND CULTURAL COMPARISONS 
ток SAMOAN, AMERICAN, AND JAPANESE Boys 


Scales 
Variable 5 с R 1 в L 
- Country 
Samoan (N = 85) 
x 13.36 19.03 1049 11.55 20.45 15.09 
5р 2.82 3.74 3.31 4.31 4.94 4.8% 


American (N 


782) 
x 15.40 14.80 12.60 18.30 14,70 14.20 
SD 5.5 6.4 49 7.3 6.3 6.6 


Japanese (N = 188) 


5 4 15.60 18.90 7.90 19.10 16.70 11.80 
SD 4.6 49 3.7 45 5.2 5.0 
Comparison 
Samoan-American — 2.04* 4.239% — 2.11% — 6.75* 575* 89 
Samoan-Japanese — 224* A3 211* — 7.55* 3.75* 3.29* 
American-Japanese — 2 —+1* 47* — 3 —20* 24 ® 


* p < .05, two-tailed t-test. 


scored higher than Japanese Boys on. Recognition, Benevolence, and Leader- 
ship and they scored lower on Support and Independence. American Boys 
scored higher than Japanese Boys on Recognition and Leadership, and they 


TABLE 11 
MEAN SCALE SCORES, STANDARD DEVIATIONS, AND CULTURAL COMPARISONS 
FOR SAMOAN, AMERICAN, AND JAPANESE GIRLS 


Scales 
Variable s С R I B L 
Country 
Samoan (N = 46) 
x 13.20 20.20 9.72 12.33 20.52 14.04 
SD 3.85 3.6. 3.78 4.32 474 441 
American (N = 666) 
x 17.5 16.0 12.7 14.3 18.9 10.3 
SD 4.8 6.1 48 6.6 5.9 6.0 
Japanese (N — 285) 
X 17.0 184 79 19.0 17.2 10.4 
SD 4.6 3.9 3.5 5.0 48 4.0 
Comparison 
Samoan-American — 4.30* 4.20 — 2.98% — 1.97* 1.62% 3.74% 
Samoan-Japanese — 330  18* 182 — 6.67% 3.32% 3.648 
American-Japanese 5 —24* 48* —47* 17* — 4 


* p < .05, two-tailed t-test. 
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1 TABLE 12 


cor 
Mean Dirrerences (Boys-Giats) or Scones ovez 
SAMOAN, AMERICAN, AND uen л. " 


Scales 
Variable N S С R I B L 
Samoan з i aae A ee as 
American 1448 —210* -—120* —.20 4.0009  —420* 3.90* 
Japanese 4733 —140* 5 — i —4 140° _ 
прагіѕзоп T 
Samoan- ; 
American 226" °з 97 ame мә 
$атоап- 
' Japanese 156 —1.67 77 — 38 43 
 Атегісап- 
Japanese — 70 —1.70° —20 3.90 —3.70* 


scored lower on Conformity and Benevolence. Of the three groups compared, 
Samoan Boys were lowest on Support and Independence, and they were highest 
on Benevolence. 

b. Girls. Table 11 contains Scale means, standard deviations, comparisons, 
and tests of significance for Samoan, American, and Japanese Girls. Samoan 
Girls scored higher than American Girls on Conformity, Benevolence, and 
tadership, and they scored lower on Support, Recognition, and Independence. 
oan Girls scored higher than Japanese Girls on Conformity, Recognition, 
evolence, and Leadership, and they scored lower on Support and Indepen- 
. American Girls scored higher than Japanese Girls on Recognition and 
volence, and they scored lower on Conformity and Independence. Of the 
*e groups compared, Samoan Girls were lowest on Support and Indepen- 
ce, and they were highest on Conformity, Benevolence, and Leadership. 
Sex comparisons. Table 12 contains sex differences (Boys-Girls) and 
of significance of comparisons of sex-differences across cultures for Samoan, 
rican, and Japanese samples. The upper half of this table shows that 
were no significant sex differences for Samoan. For the Americans, there 
significant sex differences on five of the scales: Independence and Leader- 
; and they scored lower on Support, Conformity, and Benevolence. For 
Japanese, there were significant sex differences on two of the scales: Japa- 
Boys scored higher than Japanese Girls on Leadership ; and they scored 
T on Support. 

© lower half of Table 12 shows that Samoan sex differences were sig- 
antly different from American sex differences on four of the scales: (the 
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directions of the differences should be noted) Support, Independence, Benev- 
olence, and Leadership. Samoan sex differences did not differ significantly 
from Japanese sex differences on any of the scales. American sex differences 
were significantly different from Japanese sex differences on four of the scales: 
Conformity, Independence, Benevolence, and Leadership. 


D. Discussion 


The previously reported embeddedness of Samoans in the matai system is 
reflected in the high values attached to Conformity and Benevolence, and in 
the low values attached to Recognition and Independence for all Samoan 
groups except for Academic (English) Boys. The small sex role differences 
which characterize Oceanic cultures (1) are reflected in the absence of a sex 
difference on any of the scales. 

Other aspects of Samoan culture are brought into focus when Samoan scores 
are compared with those from the United States and from Japan (Tables 10 
to 12). Samoan Boys and Girls are lower than their counterparts in the United 
States and in Japan on Support and Independence, they are higher than both 
on Benevolence, and they are lower than American subjects but higher than 
Japanese subjects on Recognition. It seems likely that the relatively low posi- 
tion of Support is attributable to the individual nature of the Support items; 
i.e., Support has the individual as the referent. While Benevolence also appears, 
on the surface, to refer to individuals, it is much more easily embedded in the 
group-oriented activities of Samoa than is Support. Being generous and 
sharing with others—welcoming guests, bringing ceremonial gifts to enhance 
the family name (10), using the “Language of Courtesy” (7)—is part of 
the Samoan pattern of behavior; however, to be emotionally dependent upon 
others in order to function adequately (to be encouraged, to be understood) 
is not part of the pattern. 3 

In the absence of a sex difference within the Samoan and within the Japa- 
nese samples, and with no difference between the differences, the data on Con- 
formity are difficult to interpret; i.e., there is no clear reason why Samoan 
Girls should be more conforming than Japanese Girls, but that Samoan Boys 
should not be more conforming than Japanese Boys. If the results are not the 
consequence of a statistical “accident,” then it may be that they reflect post- 
World War II changes in Japanese women. The behavior change for Japa- 
nese Girls (particularly of college age), and the rejection of older customs, 
may be more drastic than for Japanese males. 

The results with respect to Leadership seem much clearer: American males 
and Samoans of both sexes value Leadership; Japanese of both sexes do not. 
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The relatively high score of Samoan women is consistent with observations 
that there is a “shadow structure" (10) for women which parallels the 
matai system for men. Samoan women feel themselves capable of exercising 
leadership and they vie for leadership status. 

Relative to Japanese and American samples, therefore, Samoans seem to 
value an acceptance of a set of group-oriented mores which emphasizes (among 
other things) benevolence toward others. Despite the apparent closedness of 
the system, Samoans value the experience of exercising leadership and they 
accept and value the social mobility that permits one to move toward a posi- 
tion of leadership. They tend to reject, in the main, values which have an 
individualized cast—i.e., which in some way differentiate the individual from 
the social context—Recognition, Independence, and Support. 

In order to understand possible changes taking place in Samoa, it is necessary 
to consider the differences among students who follow different scholastic 
programs. Placement into a particular Division depends upon English com- 
prehension, previous scholastic record, understanding of an adaptability to 
Western modes of thought, alertness, aptitudes, and the kindred group’s deci- 
sion about the individual’s future. Boys who go into the Academic Division 
can look toward further education in the United States, high positions in 
Samoa with industry or the government, and leadership roles in the kindred 
gtoup. It is possible for them to become chiefs or talking chiefs. Commercial 
Boys are likely to hold good jobs in Samoa, and some (but not many) may 
assume leadership roles in the Samoan social system. Technical Boys generally 
will become day laborers for the government in carpentry or construction, or 
they may provide services to the kindred group at a relatively low occupational 
level. 

Academic Girls frequently continue their education in the United States and 
look forward to the possibility of good government positions in Samoa or in 
the United States. They often marry leaders in Samoan or in American com- 
munities, Commercial girls look toward high-status positions and marriages in 
Samoa. 

If one assumes that reaction to the test language is indicative of sensitivity 
to the corresponding culture and that Division differences indicate something 
akin to culture strain—to diversification within Samoan culture that may 
be attributable to external influence—then signs of change within the culture 
appear, 

: Within the range of girls who served as subjects, the lack of Division effects 
indicates that girls remain a relatively homogeneous group despite the different 
Vocational, marital, and status possibilities that are associated with enrollment 
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in different Divisions. There is evidence for change for the group as a whole 
in the finding that (English) resulted in a movement away from the cultural 
norm with respect to Independence (Table 6). 

For boys, the pattern of evidence is considerably different. Technical Boys 
show little sign of change, apparently being strongly embedded in the Samoan 
culture. There are no language effects for Technical Boys, and this group may 
be used as a standard for evaluating boys from the other Divisions, Commercial 
Boys evince a language difference on Independence, and this difference is 
reliably different from the difference for Technical Boys (Table 3). Com- 
mercial Boys (Samoan) have a lower mean than do Technical Boys (Samoan). 
It is as though Commercial Boys find it important to deemphasize independence 
in Samoan contexts (in order to guard themselves against rising feelings of 
independence?). 

Academic Boys, as might be expected, show the strongest signs of change. 
There are higher scores for (English) on Independence and Leadership, 
and higher scores in (Samoan) on Conformity and Benevolence. Academic 
Boys act as though they emphasize the values of the particular social context 
in which they are performing. This way of behaving makes good sense when 
one considers that they have aspirations which are viable in both Samoan and 
American cultures; indeed, as adults, these youngsters may be called upon to 
assume leadership roles in both cultures. That Academic Boys are being 
pulled away from their peers is indicated by the data contained in Table 13 in 
which significant inter-Division comparisons within Scale X Language are 
presented.* It can be seen that Academic Boys have higher (English) means 
than do boys from either of the other divisions on Independence. They have 
а lower (English) Recognition mean than do Technical Boys. For (Samoan), 
they have higher Conformity and Benevolence means than do "Technical Boys 
and they have a lower Leadership mean than do Commercial or Technical 
Boys. 

The picture that emerges is that of a society in which the people who are 
10 to 20 years away from determining its direction have begun to abandon 
older values and to accept those of another culture. It would be of great value 
to trace the course of this change by examining various strata in the society 
over a period of years. It also would be of value to determine how intensive 
these changes are by examining “deeper” values, perhaps those proposed by 


4 The tests reported in Table 13 are conservative in the sense that they do not 
use all of the df available in the study-as-a-whole. Using the error estimate from 
Table 2 (an estimate that is too small because it takes advantage of the correlations 
between scales), Academic (Samoan) Boys are also higher than Commercial (Samoan) 
Boys on Conformity and Benevolence. 
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TABLE 13 
SIGNIFICANT (№ < .05) INTERDIVISION COMPARISONS WITHIN SCALE X LANGUAGE 
(Boys) 
Language Scale Division N Mean SD t 
(English) Independence ` Academic 16 1625 378 
2.68 
Commercial 11 1236 3.64 
Academic 16 1625 3.78 
3.51 
Technical 20 1190 3.52 
Recognition Academic 16 788 270 
3.69 
"Technical 20 1175 335 
(Samoan) Independence Commercial 8 6.63 2.50 
3.84 
Technical 17 1076 2.53 
Conformity Academic 13 2246 333 
321 
"Technical 17 1935 193 
Benevolence Academic 13 25.53 3.08 s 
Technical 17194 5.07. 
Leadership Academic 13 10.69 3.27 ME 
Commercial $ 1550 6.54 
Academic 13 10.69 3.27 
2.82 
Technical 17 15.12 4.82 


Florence Kluckhohn (11). Understanding these changes might make possible 
а smoother transition from Oceanic culture and cultures in other parts of the 
world (Africa, for example) to Western industrialized society—a transition 
which does other than shatter a society and create a set of alienated people and 


anomalous customs. 
E. SUMMARY 


In an attempt to study some of the values held by Samoans, the Gordon 
Survey of Interpersonal Values was administered to 131 high school students 
in American Samoa. The students were partitioned by sex, academic Division 
(Academic, Commercial, Technical) and by test language (English, Samoan). 
For the overall sample, Conformity and Benevolence are valued most and 
Recognition is valued least. There were no sex differences on individual scales, 
but the Language X Division interaction for boys was different from that 
for girls, In comparison with Japanese and American samples, Samoan boys 
were lowest on Support and Independence and highest on Benevolence. Samoan 
girls were lowest on Support and Independence, and they were highest on 
Conformity, Benevolence, and Leadership. 
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The consistency between many of the findings and conclusions based on 


anthropological observations was noted, and evidence was presented that the 
Samoan value system is beginning to change in the presence of contact with 
industrial societies. 


20. 
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AFFILIATIVE BEHAVIOR AS A FUNCTION OF APPROACH 
AND AVOIDANCE AFFILIATION MOTIVES, OPINION 
EVALUATION, AND BIRTH ORDER*? 


Psychology Department, San Jose State College 


MARSHALL L. HAMILTON 


A. [INTRODUCTION 


In his investigation of the affiliation process in individuals, Schachter (10) 
has followed the lead of Festinger (3) in considering “the drive for self- 
evaluation” an important factor in the need for affiliation. The drive is assumed 
to apply to the evaluation of one’s emotions, abilities, and opinions. Schachter 
refers variously to “pressures to establish the ‘correctness’ of an opinion,” 
“ambiguous situations,” and “unclarity or uncertainty about an opinion” as 
conditions that should lead to the arousal of affiliative tendencies aimed at 
permitting social evaluation of opinions. It seems likely that these affiliative 
tendencies are superimposed upon the need for affiliation which is charac- 
teristic of an individual. Byrne, McDonald, and Mikawa (2) found evidence 
for both approach and avoidance motives within need affiliation, which may 
reflect “a generalized expectancy concerning people as sources of reward or 
punishment.” 

The following hypotheses are suggested by the preceding discussion : (a) 
subjects expecting to have to form an opinion on issues described as ambiguous, 
complex, and difficult demonstrate stronger affiliative tendencies than do sub- 
jects to whom the issues are described as clear-cut, uncomplicated, and easy; 
(b) the affiliative tendency is strongest in subjects with approach motivation, 
less strong in subjects with conflicting motivation and in unmotivated subjects, 
and least strong in subjects with avoidance motivation; and (с) oldest or 
only children have stronger affiliative tendencies than do later-born children 
under the experimental condition intended to arouse a high drive for self- 
evaluation, 


— 
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B. METHOD 
1. Experimental Design 


A2 X 2 X 2 X 4 factorial design was utilized with two conditions of 
opinion formation, two levels of birth order, sex, and four levels of affiliation 
motivation. A total of 192 subjects were obtained from introductory psychology 
classes, in order to have six subjects for each cell of the design. "They were 
assembled in 19 groups which varied in size from five to 34 subjects. All of the 
experimental procedure was conducted at one time for each group. 


2. Approach and Avoidance Affiliation Motives 


The first step with each group was the administration of an extended 
version of the test of Interpersonal Affect utilized by Byrne, McDonald, and 
Mikawa. The test was introduced as a new personality test undergoing stan- 
dardization. It consists of ten 834” X 11” pictures in a manila envelope. In 
the order in which the pictures are viewed, they are 102, 28, 83, 103, girl 
and woman cooking, 91, two females sitting, 24, 87, and 53 [numbered 
pictures are from Atkinson (1, pp. 832-834) ; others are from Veroff, Atkin- 
son, Feld, and Gurin (12, pp. 3-4)]. The timing and instructions given by 
Atkinson (1, pp. 836-837) were utilized, with timing controlled by having 
the subjects remove a picture from the envelope, view it for 20 seconds, and 
replace it beneath the other pictures in the envelope. Scoring was by the system 
set forth by Byrne ef al. (2). 

The original four-picture Interpersonal Affect Test (IAT) was lengthened 
by six pictures in an effort to improve its reliability. Following the procedures 
indicated above, the test was administered to 29 introductory psychology stu- 
dents and subjected to an item analysis. On the basis of the correlation between 
positive, negative, ambivalent, or lack of affect scores on a particular picture 
and the total of cach of these scores, the seven best pictures for each of the 
scores were selected.? This resulted in a mean total of 8.4 stories contributing 
to the subject’s scores on the four types of affect. The reliability figures for the 
four types of affect in the extended test ranged from .69 to .92 for interscorer, 
.41 to .67 for split-half, and .47 to .77 for three-week test-retest. 

These figures represent only a minimal gain in reliability compared to 
those reported by Byrne et al. for the four-item test. However, it seems reason- 


2 The numbers (referring to order of presentation) of the pictures which were 
pored. es Be four vim of Jeterpessonel Affect were as бөк (a) ена affect— 
i $ negative affect—1, 3, 4, 5, 7, 8, 10; i ET 

1,4, 5, 6, 7, 8, 9; (d) lack of afect—1, 3,4, 5, 7, 9, 10. ' cron MEA 
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to assume that the extended test is a more adequate behavior sample be- 
of its increased length and variety of interpersonal situations pictured. 
problem of test-retest reliability was compensated for by conducting all 
‘of the procedures within the same hour. In considering the figures for split- 
alf reliability, it is apparent that they represent an improvement over similar 
figures for п affiliation, such as the .16 figure reported by Byrne et al. 
р. 27). 

In order to test the effects of order of the pictures on test scores, 15 of the 
ects took the test with the 10 pictures in the order listed above, while 15 
bjects used the pictures in the order 6 to 10, 1 to 5. Reitman and Atkinson 
1, p. 671) have observed some order effect in n affiliation protocols. How- 
er no order trends were observed in the present test, and the scores for each 
in the two orders correlated .87, .75, .69, and .87 for positive, negative, 
nbivalent, and lack of affect, respectively. In view of the imperfect reliability 
the scoring method, these figures suggest little difference between the two 
TS. 

о assign subjects to the four approach-avoidance motivation patterns, а 
rent set of positive-negative ratios from those used by Byrne et al. was 
, since the total possible score was increased in the extended test. A subject 
assigned to the approach motivation group if his total number of positive 
fect stories was greater, by two or more, than his total number of negative 
ories, If the number of negative stories was greater, by two or more, than 
number of positive stories, he was assigned to the avoidance group. When 
th ratio of positive to negative stories was equal, or differed only by one, the 
ect was assigned to the mixed motivation group, except that ratios of 
0, 0-1, or 0-0 indicated assignment to the unmotivated group. 


3. Experimental Conditions 


_ Following the administration of the LAT, the subject received at random 
Ine of two written descriptions of an experiment, in which he was led to 
eve he was to participate within 24 hours. He was told that the experiment 
as independent of the test which he had just taken. Both forms introduced a 
tudy of the "psychological aspects of the opinion formation process,” in 
ch subjects would be given a list of issues to study, and would later be 
oned regarding their evaluation of the issues. During the questioning 
experimenters would make observations about the opinion formation 
ess. 

The experimental group (subjects receiving Form I) read of ambiguous, 
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complex issues involving many variables. They were told that they would 
probably have had little contact with the issues, would be unclear and un- 
certain about their opinion, and that there were many factors for and against 
the various stands which could be taken. The control group (Form II) read 
of unambiguous, simple issues involving few variables. "They were told that 
they would probably have had considerable contact with the issues, would be 
clear and certain about their opinion, and that the favorable factors would 
be clearly centered around one of the stands which could be taken. The variety 
of adjectives is aimed at creating two levels of the conditions which Schachter 
and others describe as leading to the arousal of affiliative tendencies aimed at 
social evaluation of one's opinion. 

To determine the effectiveness of the descriptions, both forms end with a 
six-item semantic differential procedure [instructions from Nunnally (7)] 
in which the subject is asked to indicate what he expects the task of evaluating 
the issues to be like. The items are clear-hazy, unusual-usual, simple-complex, 
difficult-easy, precise-vague, and ambiguous-unambiguous, 

Following the completion of the form, the groups were told that, contrary 
to the written description, there would be a delay in the beginning of the study 
and that the experimenter would have to contact them later rather than begin 
taking subjects immediately. It was thought to be necessary to use all of the 
groups within a few days in order to prevent any suspicion about the bogus 
nature of the experiment from spreading. 


4. Affiliative Tendency 


In the description of the opinion formation study, the subject was asked to 
indicate whether he would prefer to be alone while he was considering the 
list of issues, or to be with other students who would be talking and thinking 
about the same list of issues. He was then asked to indicate the strength of 
his preference on a six-point scale, the explanation being that it might not be 
possible to make suitable arrangements for everyone. This method of mea- 
suring the dependent variable is quite similar to that used in most of the 
studies cited below. It is identical with that used in Schachter’s later studies, 
except that the neutral category is omitted from the initial choice of waiting 
conditions, as in the procedure followed by Gerard and Rabbie (4). 


5. Ordinal Position 


Questions allowing the determination of ordinal position were included at 
the beginning of the description of the opinion formation study, along with 
the blanks for other identifying data. Subjects with twins were not used. 


MARSHALL L. HAMILTON 65 


6. Sex 
It seemed advisable to analyze the sex variable, since previous investigators 
have sometimes used only male subjects [see Sarnoff and Zimbardo (9) and 
Zimbardo and Formica (14)], only female subjects [see Schachter (10), 
Radloff (8), and Singer and Shockley (11)], or both in segregated groups 
[see Wrightsman (13) and Gerard and Rabbie (+) ], with some variation in 
results. 


C. RrsuLTS 
1. Effectiveness of the Experimental Conditions 


The mean preference for being with others is shown by level of each factor 
in Table 1. The analysis of variance indicated that the experimental manipu- 
lations did not influence the affiliative tendency. In fact the mean preference 
- for being with others is slightly lower in the experimental group than it is in 
_ the control group. That the two forms were effective in creating the intended 
experimental conditions is apparent from inspection of the mean ratings of 
the semantic differential items by the two groups as shown in Table 2, and 
the analysis of variance which yielded an F of 54.33 (p < .001). It is apparent 
that Form I was effective at a reliable level in leading a subject to expect a 
future opinion formation task to be more hazy, unusual, complex, difficult, 
vague, and ambiguous than is true for a subject receiving Form II. Essentially 
identical results were achieved earlier when the two forms were administered 
in a pilot study. "Thus, the conditions necessary for the operation of the drive 
for self-evaluation seem to have been created, but the affiliative tendency was 
not influenced. 


TABLE 1 
MEAN Responses BY LEVELS OF THE EXPERIMENTAL CONDITION, ORDINAL POSITION, 
Sex, AND MorivATION PATTERN FACTORS ON THE Six-PoiNT SCALE OF 
PREFERENCE FOR BEING WITH OTHERS 


Factor and level N Mean SD 
Experimental condition 
Experimental (Form I) 96 3.91 1,53 
Control (Form II) 96 4.00 1.13 
Affliation motivation 
Approach 48 4.37 83 
Avoidance 48 3.97 1.51 
Mixed 48 3.85 1.33 
Unmotivated 48 3.62 1.43 
Ordinal positi 
Early-born И; 96 3.91 1.28 
Later-born 96 4.00 1.38 
Sex 
. Males 96 3.92 1.23 
Females 96 3.99 144 


Sele rr EI E liccc 
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TABLE 2 
MEAN SEMANTIC DIFFERENTIAL RATINGS® 
Item Form I SD Form II SD 
Hazy-clear 3.91 1.73 3.54 1.71 
Unusual-usual 4.00 1.64 3.24 1.95 
Complex-simple 4.92 1.75 4.15 1.73 
Difficult-easy 3.35 1.57 2.68 1.45 
Vague-precise 441 1.62 3.71 1.59 
Ambiguous-unambiguous 3.36 1.64 2.21 1.51 


a High means indicate the more hazy, unusual, etc. ratings. 


2. Effectiveness of the Subject Variables 


‘The approach-avoidance affiliation motivation pattern of the subjects is the 
only main effect which is significantly related to the affiliative tendency 
(F = 3.64, p < .02). However, the order of the groups is not as predicted. 
The highest mean preference for being with others did occur in the approach 
motivated group, but the avoidance group ranked above the conflict and un- 
motivated groups rather than below them. 

Although the analysis of the semantic differential items indicates that the 
conditions necessary for the operation of the ordinal position effect were 
created, neither the main effect nor the interaction of ordinal position with 
experimental conditions approached significance. 

No predictions regarding the influence of sex differences on affiliative behav- 
ior were offered, and little influence was indicated in the analysis. 

The only interaction to approach or reach significance was the (unantici- 
pated) four-factor interaction. It seems to reflect the preference of early-born 
unmotivated females for being with others in the experimental condition and 
alone in the control condition, while later-born, unmotivated females prefer to 
be alone in the experimental condition and with others in the control condition. 

One contradiction between the assumptions involved in the statistical anal- 
ysis and the experimental procedure used is that the subjects are assumed to 
be independent even though they were used in a series of groups. Several 
factors support the assumption of independence. The first of these involves 
the subjects used. Acquaintances among the subjects were probably minimal, 
since the groups were assembled during the second week of the semester; 
the subjects were drawn from four large classes, and in most of the groups 
more than one class was represented. During the experiment the subjects were 
seated at least one seat apart in a large classroom. The two parts of the 


experimental procedure kept them well occupied in reading and writing during 
the 50-minute procedure. 
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If the responses were influenced systematically by group membership, it 
should be revealed by the application of a single classification analysis of 
variance to the groups. Application of this technique yields an F of .98, which 
does not suggest any influence on the dependent variable by group membership. 

An additional comparison was made between 10 subjects who were admin- 
istered Forms I or II individually (mean = 3.50) and the subjects who par- 
ticipated in the group procedure (mean = 3.95). The obtained ¢ (1.22, 
200 df) was not significant. 

Each of these considerations suggests justification for the assumption of the 
independence of the subjects. 


D. Discussion 


In the line of research on afüliative behavior initiated by Schachter (10), 
it has been the interaction of the high drive for social evaluation condition 
with early ordinal position which has resulted in the highest preference for 
being with others. While the high and low drive conditions typically affect 
affiliative behavior independent of ordinal position, ordinal position has not 
affected affiliative behavior independent of the high drive condition. In the 
present study both aspects of the interaction contain possible explanations for 
the lack of support for the social evaluation model. 

Probably the more likely explanation for the apparent lack of arousal of 
the self-evaluation drive lies in the experimental manipulations. While it is 
clear that the subjects in the two experimental groups were exposed to two 
levels of the conditions thought to be necessary for the drive for self-evaluation 
to operate, it may be that the two descriptions of the “impending” experiment 
were appreciated only at an abstract, verbal level. The present procedure was 
quite similar to that used in the studies (10) involving the creation of two 
levels of anxiety through descriptions of hypothetical experiments. However, 
in dealing with opinion rather than emotion, it may be more difficult to bring 
about the degree of involvement necessary for the self-evaluation drive to 
operate, Radloff (8) may have accomplished this with his procedure which 
required subjects actually to form an opinion on a specific issue. (A more direct 
test of the social evaluation model will come from arousing different levels of 
drive, as was attempted in the present study, rather than assuming that the 
drive has been aroused and then studying different levels of satiation, as in 
Radloff’s procedure.) At any rate it seems unlikely that improvement of the 
Present experimental manipulations would achieve the desired effect, since the 
differences between the two experimental conditions were not even in the right 
direction, 
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If a higher level of the drive for self-evaluation was not aroused in the | 
experimental group, the ordinal position effect would not be expected to 
occur, since it has not previously been demonstrated independent of the drive 
condition. 

Another explanation for the apparent failure of the drive arousal attempt 
comes from subject differences which might be relevant to affiliative behavior. 
The subjects in previous studies have been, variously, volunteers (9, 10), 
paid volunteers (4), unspecified (8), or have been offered extra credit in their 
course work (10, 13). Zimbardo and Formica (14) and Singer and Shockley 
(11) recruited subjects who were apparently required to participate in experi- 
ments, as is true of the subjects in the present study, and they did not find 
the overall ordinal position effects to be significant. In addition, the subjects 
in these studies have been college students from the Northeast, whereas the 
present subjects are University of Texas students. 

The possibility of regional differences seems especially relevant to the ordinal 
position variable. Glass, Horwitz, Firestone, and Grinker (5) are among 
those who point out that ordinal position per se is not a psychological variable, 
and it should not be expected to show similar effects among persons from 
different sociological backgrounds. After reviewing a large number of the 
earlier studies of birth order, Murphy, Murphy, and Newcomb (6) con- 
cluded that the objective fact of ordinal position was sure to yield meager 
results if its meaning in the specific case was ignored. It seems likely that 
further research along this line will profit more from efforts to deal with 
the personality variable underlying the ordinal position effect than from ex- 
pecting the gross division between early- and later-borns to yield consistent 
results. The present attempt to control for generalized expectancies concern- 
ing others as indicated in approach and avoidance affiliation motives seemed 
promising. 

Even though the attempt to improve the reliability of the ТАТ was in- 
effective, it proved to be the only variable in the present study which was 
clearly related to affiliative behavior. The responses of the four affiliation 
motivation groups were largely consistent with the tentative theoretical 
sketch offered by Byrne eż al. (2). As might be expected, the mean choice 
of subjects with approach affiliation motivation fell between “I prefer being 
together with others" and "I slightly prefer being together with others." Only 
seven of the 48 subjects in this group preferred to be alone, as contrasted with 
16, 17, and 21 subjects in the avoidance, mixed, and unmotivated groups. The 
mean choice of the unmotivated group conformed to expectation by falling 
near the midpoint between “I slightly prefer being alone” and “I slightly 
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prefer being together with others.” As the above authors had found, the mixed 
motivation group responded much like the unmotivated group. 

The only group that differed clearly from the predicted order was the 
avoidance group. Rather than showing the least preference for being with 
others, the mean response fell near “I slightly prefer being together with 
others." One possibility is that these subjects chose to be with others through 
motivation different from that of the approach motivated subjects. Since sub- 
jects are assigned to this group through producing a predominance of stories 
involving active or passive aggression, they may be characterized by aggressive 
drives which can only be satisfied in interaction with others. 

The Interpersonal Affect scoring system has proven itself to be a useful 
measure of affiliation motives, and a measure worthy of refinement. Perhaps 
the most fruitful approach would be to use the present test as an external 
criterion to aid in development of an “objective” measure of affiliation moti- 
vation, The measure should prove useful in further studies of affiliative be- 
havior, and would make it possible to compare subjects across experiments 
which differ in aspects relevant to affiliative behavior. 


E. SUMMARY 


Previous research led to the postulation of a drive for self-evaluation as an 
important factor in affiliative behavior. One hundred and ninety-two subjects 
were assigned to two experimental conditions aimed at creating different levels 
of the drive for self-evaluation as it applies to the evaluation of one’s opinions. 
Subject differences in affiliation motivation, ordinal position, and sex were 
analyzed, Although the experimental conditions were effectively created, no 
differences in affiliative behavior occurred between the two conditions. Of the 
subject variables, affiliation motivation pattern, as measured with an extended 
version of the Interpersonal Affect Test, was significantly related to affiliative 
behavior. The IAT should prove a useful measure in further studies of affilia- 
tion, particularly as a replacement for the superficial variable of ordinal 
Position, 
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CRITIQUE OF THE JANIS AND FESHBACH STUDY* 
Department of Psychology, Montana State University 


J. D. Dukx 


A. INTRODUCTION 


In an experiment using high school subjects, Janis and Feshbach (5) found 
iT-arousing communications less effective than a comparable appeal which 
the inclusion of threat references, Subjects exposed to their nonemo- 
inal appcal indicated by questionnaire responses that they more often accepted 
е communication recommendations than did the subjects exposed to the fear 
ippeals. The subjects exposed to the nonemotional appeal also evidenced greater 
fesistance to a counter propaganda appeal they were exposed to one week later. 
hese findings by Janis and Feshbach seriously question the traditional assump- 
that fear appeals increase the propagandist’s chances to induce his audience 
0 adhere to his recommendations. 
Janis and Feshbach cautioned against overgeneralization from their study, 
ut developments since 1953 reveal that their caution has not been heeded. 
т findings have been assimilated into general psychology [see, for examples, 
d (3); Krech and Crutchfield (7); Morgan (8); and Ruch (9)]. 
т study is widely quoted in discussions about factors affecting propaganda 
cy. If one judges by contemporary mass media public service messages, 
у federal agencies have accepted in toto the Janis and Feshbach interpre- 
tations. Overgeneralization appears to be the rule rather than the exception. 
This paper proposes to examine a major weakness in the Janis and Feshbach 
tudy, and to report an experiment bearing on the validity of interpretations 
Made by Janis and Feshbach. 


B. RESUME OF THE JANIS AND FESHBACH STUDY 


Serving as subjects were 200 freshmen of a Connecticut high school. There 
Were three experimental groups, and one control group. Each group was 
comprised of 50 subjects. The three experimental groups were exposed to 

Pals about dental hygiene, while the control group was exposed to a 
ommunication about the structure and functioning of the eye. The experi- 
mental appeals were designed to contain the same information about dental 
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hygiene, and the same recommendations, but the appeals were constructed to 
vary in degree of fear arousal, There were a “strong fear appeal” group, a 
“moderate fear appeal” group, and a “minimal fear appeal” group. Fear arousal 
was achieved by showing slides of horrible dental pathology, and by including 
threat references of the this-can-happen-to-you variety in the accompanying 
lecture. 

The following measures were obtained. Conformity to the communication 
recommendations was measured by a questionnaire taken by all groups one 
week before, immediately after, and one week after exposure to their respective 
appeals. Immediately after exposure to their appeals, a 23-item quiz tested 
learning of the factual content of the three experimental appeals. Just prior 
to the final questionnaire, all subjects heard a counterpropaganda appeal 
specifically disputing a major argument advanced in the experimental appeals. 
Changes in the final questionnaire responses thus assessed "resistance to coun- 
terpropaganda.” Incidental measures helped establish that the three experi- 
mental appeals varied as expected in degree of emotional arousal. 

Janis and Feshbach reported that the three groups did not differ significantly 
in degree of learning, but that all experimental groups performed better on 
the 23-item test than did the control group. Then they went on to make the 
conclusions cited in the introduction—namely, that the strong appeal was least 
effective, while the minimal appeal was most effective in inducing the subjects 
to report adoption of the dental practices urged by the experimental appeals. 
In addition, it was found that all three groups significantly resisted the counter- 
propaganda, but the minimal fear group resisted significantly more than did 
the moderate or strong fear groups. 

Janis and Feshbach interpreted their findings as consistent with a “defensive 
avoidance" hypothesis. Subjects exposed to fear appeals ignore or minimize 
the importance of the threat references in order to reduce or avoid anxiety. 
They fail to think about the appeal's recommendations, therefore are less influ- 
enced by the appeal. Janis and Feshbach briefly considered and rejected alter- 
native interpretations of their findings. 


C. CRITICISM or THE JANIS AND FESHBACH STUDY 


Perhaps the criticism advanced here will not justify rejection of the Janis 
and Feshbach conclusions, but it should caution again against premature and 
unqualified acceptance of their findings. 

The major criticism of the Janis and Feshbach study has to do with their 
cursory treatment of learning differences among the three experimental groups. 
Since differential conformity could be the result of differential learning, it 
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behooved Janis and Feshbach to give special attention to the problem of mea- 
suring differences in learning. They did report that “no significant differences” 
occurred, but there is a surprising lack of information about the 23-item test 
they used. Supporting data were included in neither the major reference cited 
nor in another account of the same experiment (4). In fact, the reader has to 
look at an account of another experiment (6) to learn that the 23-item test 
was a “recognition” test. The kind of recognition test employed is still un- 
reported. 

The null hypothesis is easily accepted if the*measuring instrument is un- 
reliable or insensitive. With such a test, substantial differences in learning 
may go undetected. This is not to assert that the Janis and Feshbach test was 
insensitive, but Janis and Feshbach should have given a full description of 
their test and let their empirical data speak for itself. In 1958 a request by 


the writer for this information went unanswered. 


The subsequent Janis and Milholland experiment suggests the Yale inves- 
tigators had doubts about the efficacy of their 23-item test. In this experiment 
one group of adults heard the strong appeal used in the Janis and Feshbach 
experiment, while an equated group heard the minimal appeal. Then, in the 
words of the authors, a “more sensitive” test of learning, a verbatim recall 
test, was administered to both groups. This choice of words is the first hint that 
the Yale investigators might not have had full confidence in their 23-item test. 

Content analysis of the results of the verbatim recall test reveal that im- 
portant qualitative differences in learning did occur. Both groups remembered 
approximately the same amount of information, but the strong fear appeal 
group better remembered the consequences of dental pathology, while the 
Eroup exposed to the minimal fear appeal better remembered the causes of 
dental pathology. Perhaps the group viewing the gory slides found the sub- 
sequent discussion of causes and recommendations anticlimactic and less 
attention-catching than the vivid slide material. Janis and Milholland did not 


- Pursue this line of reasoning. Instead, they reported increased confidence in 
the original Janis and Feshbach interpretations. 


Another bit of evidence in support of the hypothesis that differential learn- 


ч ing may have occurred can be gleaned from the Janis and Milholland study. 


erein one learns for the first time that the fear appeal used in both studies 


4 Was considerably longer than the minimal appeal. 'The former contained 270 
- Words, while the latter contained only 190. Furthermore, in addition to being 


about 44. per cent longer in word length than the minimal appeal, the fear 
Communication had several fear slides interpolated in the first part of its 
- Présentation. It appears, then, that the strong appeal not only aroused more 
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emotion among audience members than did the minimal appeal, but also trans- 
mitted considerably more information to them. The history of psychology is 
replete with studies showing that the learning of material specific to a com- 
munication is more efficient when additional information is not appended to 
the communication than when such information is appended. Any additional 
material, emotional or nonemotional in tone, should interfere with the learning 
of the minimal fear appeal content. 

If we assume that differential learning did in fact occur, then it would 
be easy to rationalize the results of the Janis and Feshbach study. ‘The group 
exposed to the slides less well learned the communicator's recommendations. 
Therefore, they less often conformed to his suggestions. For the same reason, 
they were more apt to be swayed by counterpropaganda than were subjects in 
the minimal appeal group. 

"There are other reasons to avoid overgeneralization from the Janis and 
Feshbach study, but these are not due to any presumed weaknesses of their 
study. Some of these criticisms will be noted in the discussion. 


D. ANOTHER EXPERIMENT OF THE EFFECTS OF 
FEAR APPEALS ON LEARNING 


Duke (2) advanced some of the criticism above, and reported an experiment 
bearing on the interpretations of the Janis and Feshbach experiment. A de- 
tailed resume of that study is recorded here. 


1. Method 


Subjects were 393 male University of North Carolina students, members of 
13 introductory hygiene classes. There were six experimental groups and one 
control group. The 363 subjects of the experimental groups all heard a stan- 
dard tape about syphilis, but the conditions under which this tape was heard 
varied in six ways. These treatments will be described after the three commu- 
nications used in the experiment have been identified. 

a. The three communications. The standard communication was а non- 
emotional taped presentation about syphilis, its history, causes, and prevention. 
All but the control group of 30 subjects heard the standard communication. 

The fear communication was a tape and slide presentation about pathology 
resulting from untreated syphilis. Groups exposed to the fear communication 
saw 10 color slides vividly portraying the gory consequences of syphilis in the 
advanced stages. The slides showed nude bodies with ugly large sores, ulcer- 
ated vulvas, chancrous penises, and elephantiasis-like limbs. While the slides 
were being shown, the taped voice continually reminded the audience that 


J. D. DUKE 75 


these consequences might happen to them if they did not learn what they should 
know about syphilis. That the fear appeal was emotion arousing was evident 
in the behavior of audiences exposed to this communication. They watched in 
hushed silence and showed many signs of tension release at the end of the 
communication, such as talking, wisecracking, and restless movement in their 
seats. This sort of behavior was not manifest in groups exposed to the other 
communications. 

The neutral communication consisted of a taped lecture about venereal 
disease and antibiotic research using animal subjects. The neutral and fear 
presentations required about the same amount of time, but there was no way 
to assess whether or not the two communications transmitted comparable 
amounts of information. 

A description of the slides and a verbatim reproduction of the three taped 
communications can be found in the appendices of Duke (2). 

b. The six experimental groups. Various combinations of the three com- 
munications were used to create the six experimental treatments of this study. 
"These are described below: 

One group of subjects heard only the standard communication. By analogy, 
this group heard an appeal much like the minimal appeal group in the Janis 
and Feshbach experiment. 

А second group heard the fear communication approximately 48 hours before 
they heard the standard communication about syphilis. To facilitate the illusion 
that the two communications belonged together, the experimenter pretended 
that the tape machine broke down right after the fear communication had been 
Presented. The student classes receiving this treatment were told that the 

fam would have to be finished at the next class meeting" 48 hours later. 

is treatment was like that received by strong fear subjects in the Janis 

and Feshbach experiment except that the fear slides and tape preceded the 
Standard communication by 48 hours. 

A third and fourth group heard the same standard communication, but 

card either the fear or the neutral communications immediately before hearing 
Пё standard communication. Should differential learning occur among the 
‘IX groups, comparison of these two groups would help determine why the 

Ces occur, 
к For the fifth and sixth groups, the standard communication was deliberately 
interrupted in six preset places, At each interruption, one slide was flashed 
lY on a screen in front of the audience. For the fifth group, the six slides 
Were the same as those used in the fear communication. For the sixth group, 
only one slide was used for all six interruptions. It was a neutral slide entitled 
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in bold letters, “Facts About Syphilis.” For both the fifth and the sixth groups, 
the interruptions were rationalized to the groups as an effort to remind them 
about the “seriousness of syphilis.” 

One final group, another introductory hygiene class, served as a control 
group. This group did not hear any communication about syphilis, but the 
group did take the test described below. 


2. The Test of Learning 


A 33-item, five alternative, multiple choice test of learning the material 
contained in the standard communication was given to all experimental groups 
on two occasions, once immediately after exposure to the standard communi- 
cation, and again two weeks later. The control group took the test only once. 
Test-retest reliability was -+.79. A copy of this test is also included in the 
Appendices of Duke (2). 


3. Results and Discussion 


Table 1 records the test performance averages for the seven groups. The 
marked differences between the control group and the six experimental groups 
verify that the standard communication communicated significant amounts of 
information about syphilis. The control group averaged missing 17 items, while 
no experimental group averaged missing more than eight items on the test. 

The observed differences among the means of the experimental groups sup- 
port the hypothesis of differential learning. The two groups best learning and 
retaining the information contained in the standard communication were the 
groups that heard only the standard communication on the day they were 
tested. The fact that one of the groups had heard the fear communication 48 
hours earlier did not affect how well it subsequently learned the standard 
communication material. 

The two groups who heard an additional communication just prior fo 
exposure to the standard communication did not learn the test information 
as well as the groups with no preceding communication. Kind of preceding 
communication once again did not seem to affect the results. 

Finally, Table 1 shows that the poorest performances occurred only in the 
two groups in which the standard communication itself was interrupted. 
Again, whether the interpolated slides were emotion arousing or not had no 
bearing on the amount of interference in learning. Both kinds of interruption 
significantly disrupted learning. 

Dr. Harold McCurdy, faculty adviser of the writer at the time of this 
study, suggested an analysis by which 'Table 1 results can be integrated. Since 
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TABLE 1 
Test AND RETEST ERROR SCORES ON THE 33-1ТЕМ TEST 
Test Retest 
No. Mean No. Mean 
Group of $s errors SDs of 5з errors SDs 


Group 1: standard 
communication alone 67 4.67 2.88 66 6.14 3.75 


Group 2: fear 
communication 
48 hours before 


standard 57 4.98 3.70 55 5.75 3.65 
Group 3: neutral 

communication 

immediately before 

standard 80 5.29 3.92 78 6.64 3.87 


Group 4: fear 
communication 
immediately before 
standard 74 5.77 344 70 10 paci 


Group 5: standard 
interrupted 
by fear slides 57 646 3.78 48 8.58 5.68 


Group 6: standard 


interrupted z 
by neutral slide 29 7.65 402 26 792 420 


Group 7: control—not 
exposed to any 
communication 30 17.00 6.20 


it is contended that the emotional dimensions of the communications were 
not responsible for the differential learning effects, the emotional variable can 
be ignored altogether. If the six experimental groups are classified according 
to the degree of distraction they endured, three pairs of groups emerge. The 
group that heard the standard communication alone and the group that heard 
the fear communication 48 hours before exposure to the standard communi- 
cation were the “least distracted” groups. The two groups that heard fear 
or neutral communications just prior to the standard appeal were inter- 
mediately distracted,” while the two groups in which the Seres communi- 
cation was actually interrupted were the "most distracted groups. "Table 2 
Shows that learning of the standard material was significantly and inversely 
related to the amount of distraction (either by additional material or by inter- 
Tuption) accompanying the standard communication. The computed f-scores 
establish that learning was best in the groups hearing only the standard com- 
munication on the day they were tested, and poorest for the two groups in 
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which the standard communication was actually interrupted. Appending ad- 
ditional communications to the standard communication was intermediately 
disturbing. 


TABLE 2 
EFFECT oF DISTRACTION ON LEARNING 
Least Intermediately Most 
distracted distracted distracted 
Variable groups groups groups 
No. of Ss 124 153 86 
Mean errors 4.81 5.83 749 
SDs 3.26 3.56 3.93 


Soh Ee UNE ee en eee ee ee 
Note: t-ratio comparisons are as follows: Least vs. Intermediate: t = 2.48 (5 < .02); 
Intermediate vs. Most: £ = 3.24 (p < .001) ; and Least vs. Most: £ > 6.00 (p < .001). 


Just for the record, Table 1 also records retest scores for the six experimental 
groups. As expected, retention scores were poorer than learning scores, but in 
general the retention scores roughly paralleled initial learning scores. If the 
fear appeal caused repression or defensive avoidance behavior among the sub- 
jects, this result was not detectable from the data collected. Indeed, the best 
observed retention score occurred for the group that was exposed to the 
fear communication 48 hours before they heard the standard communication. 
It is quite clear from the data that the provocative slide material in no way 
disturbed their learning and retention. 

In summary, the experiment by Duke could find no differential learning 
effects attributable to the fear communication or slide material per se. On the 
other hand, learning and retention scores were adversely affected when addi- 
tional communications preceded the standard communication or when the stan- 
dard communication was interrupted at predetermined times. The latter more 
seriously affected learning and retention than did the former. 


E. Discussion: THE Erricacy or FEAR APPEALS 


Evidence within the account of their experiment and from an experiment 
by Duke questions the validity of interpretations advanced by Janis and Fesh- 
bach. In dispute is their assumption that differential learning did not occur 
among their experimental groups. Presumptive evidence suggests that their 
strong appeal, due to inclusion of threat references, was longer than the 
minimal appeal, therefore distracting the strong appeal group from optimal 
learning of factual ‘information contained in all three communications. 
experiment by Duke supports this analysis, for additional material, whether 
emotional or not, was found to disrupt the learning of information contained 
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a standard communication. The differential conformity reported in 
lanis and Feshbach study may be an artifact of differential learning. At 
| their interpretations are uncertain and invite more definitive research 
in the efficacy of fear appeals. 
It should be pointed out that another recent study also questions the Janis 
and Feshbach findings. Berkowitz and Cottingham (1) found that a strong 
appeal induced more favorable attitudes toward automobile safety belts 
n did a weak appeal. Paradoxically, the strong appeal was more effective 
a group for whom the use of safety belts should be less relevant: namely, 
ng students who made less frequent use of automobiles in their daily 
The Wisconsin investigators attributed the greater efficacy of the strong 
to the fact that it enhanced interest value of a communication which 
Wise was of low interest to the audience. 
even if subsequent research bears out Janis and Feshbach’s original 
about the inefficacy of fear appeals, application possibilities are never- 
limited. All the studies so far in this area have made use of captive 
ces. The somewhat coerced behavior of students in the classroom may 
y different from the relatively free behavior of mass media audiences. 
rkowitz and Cottingham’s results suggest, emotional material may 
e interest value, thus helping the mass media propagandist to attract 


le it is true that political propaganda, bond issues, fluoridation proposals, 
the like, will be lively contested by counterpropaganda, there are many 
г propaganda campaigns highly unlikely to be opposed by counterpropa- 
For examples in our society, who would oppose public service messages 
driving safely, supporting the Red Cross, learning the warning signs of 
т, going to church, registering at election time, getting a chest X-ray, 
The use of emotional or fear appeals may be useful in one situation but 
in the other. It is quite reasonable that monopoly propagandists might 
ize their returns by using a healthy variety of appeals. 

he use of fear appeals may often be opposed on ethical grounds, but it 
ins to be empirically demonstrated under what conditions fear appeals aid, 
or have no bearing on the outcome of propaganda campaigns. 


+ 
F. SUMMARY 


\ 1953 experiment by Janis and Feshbach concluded that fear appeals were 
Successful than nonemotional appeals in getting audiences to adopt the 


» 


andist’s recommendations. The Janis and Feshbach experiment was 
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re-evaluated and a methodological criticism was advanced. Several kinds of 
information suggest that Janis and Feshbach may have failed to detect real 
differences in learning among their three treatment groups. They reported 
giving a 23-item test to their groups, and they accepted the null hypothesis 
of no difference in learning among the groups; but data were lacking, and 
information contained in the subsequent Janis and Milholland experiment sug- 
gested that they may have made the Type II error in accepting the null hy- 
pothesis. Due to the inclusion of threat references and the interpolation of 
fear-arousing slides, their fear appeal was much longer than their less emotional 
appeals. This fact alone would predict better learning in the group exposed 
to the shorter appeal. 

An experiment by Duke did find differential learning effects as a function 
of communication length. Learning and retention of nonemotional facts about 
syphilis were significantly disturbed under the following conditions: (a) if a 
standard communication was interrupted by either emotional or nonemotional 
additions, and (b) if the standard communication was preceded by either 
emotional or nonemotional communications. It is important to note that the 
emotional content of the communications for both situations had no detectable 
influence on the results. 

Since the Janis and Feshbach study did not control for the various lengths 
of the three communications, the conclusion that fear appeals are inefficacious 
is equivocal. Janis and Feshbach have not convincingly demonstrated that their 
results were not an artifact of differential learning. 
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THE EMERGING SELF-CONCEPT IN RELATION TO SELECT 
VARIABLES OF SECONDARY SCHOOL STUDENTS* 


The University of Georgia 


WirLiAM F. WHITE, Harry E. ANDERSON, JR., AND HAYNES CRYDER 


А. INTRODUCTION 


As the adolescent advances into adulthood, the role of the self becomes in- 
creasingly important in the selection and regulation of behavior. Concerned 
with problems of identity, relationships to parents, and association with peers, 
the adolescent high school youth is fixed at a crucial point of planning his 
future and establishing his independent self in a complex world. “Until youth 
begins to plan, the sense of self is not complete” (1, pp. 126-127). 

Allport hypothesized seven aspects of “becoming,” or sense of self, unified 
under the single phenomenon of the “proprium.” These aspects were reported 
as "states of self relevance we feel . . . they compose the means felt and 
known . . ." (1, p. 195). Not remaining static, these aspects continue to emerge 
and become a sense of self, expanding with experiences as one's circle of par- 
ticipation becomes larger. In early infancy, according to Allport's theory, 
three distinct stages develop and increase with age: (a) a Sense of Bodily 
Self, (b) a Sense of Continuing Self-Identity, and (c) a Sense of Self-Esteem, 
or Pride. During ages 4 to 6, a Sense of Self-Extension and a Sense of Self- 
Image are theorized. Expanding from middle childhood to puberty, ages 6 to 
12, a Sense of Self as a Rational Coper and as a Propriate Striver now emerges. 
These levels of “becoming” are not mutually exclusive but expand and develop 
ina coexisting pattern. The question of how this pattern of evolving sense of 
self is reflected in the growth and development of adolescents during high 
school is open for empirical study. 

The problem can be generally stated. Does the self-concept tend to modify 
and mature as the individual progresses from early adolescence to middle 
adolescence? Specifically, do certain characteristics of the self-image tend to 
* more prevalent at different ages and at different stages of the high school 
Curriculum? Does sex, formal education of parents, or extracurricular activities 
Play a significant role in the development of the self-concept during the transi- 
Чоп from freshman to senior year in high school? The purpose of the present 
cem — 

1946 eeived in the Editorial Office, Provincetown, Massachusetts, on January 25, 
* vobyright, 1967, by The Journal Press. 
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study is to examine the relationship of the aforesaid variables to the emerging: 
self-concept within the framework of Allport’s theory. 


B. METHOD 
1. Subjects 
The subjects were secondary school students from a public school in 
State of Georgia. Random samples of 60 white students were selected from 
each of four grade levels, ninth through twelfth. Lack of complete data гез 
ported by 15 students necessitated the extraction of these students from t 
original samples. Therefore the analyses of data for this study are based on 
total sample of 91 boys and 134 girls. 


2. Procedure 


Based upon Allport's (1) theoretical model of the evolving sense of self, 
McKinney (6) developed a self-concept inventory, The McKinney Sentence 
Completion Blank (MSCB), which was much less structured than Q-sorts 
and similar methods frequently used in the investigations of the self-concept. 
Three examples from the MSCB are the following: 

1. My main interest 
2. I look forward to 
25. I am the kind of person who 


The MSCB was administered to the four groups of subjects, simultaneously; 
in a large auditorium. Subjects were instructed to complete the 46 items 
of the MSCB, freely stating their attitudes and feelings. It was emphasized 
that there were no right or wrong answers, and that each student would be 
allowed as much time as needed to complete the inventory. b 

The 46 responses of the MSCB were classified by three judges into appro- 
priate categories related to the various levels of Allport's evolving sense of 
self: (a) Sense of Bodily Self, (5) Sense of Continuing Self-Identity, (с) 
Sense of Self-Esteem, Pride, (4) Sense of Self-Extension, (e) Sense of Self 
Image, (f) Sense of Self as a Rational Coper, and (g) Sense of Self as 
a Propriate Striver. Responses that could not be classified into one of the seven 
categories were placed into an eighth, Unclassified Category. All three judges 
agreed on 88 per cent of the item classifications. When at least two judges 
were not in agreement on a classification of a particular item, the item wa 
placed in the Unclassified Category. 


3. Analysis 


For each individual, the numbers of items in the MSCB classified in each 
of the eight categories were changed to indicate percentages. The six inde 
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pendent variables of major interest were as follows: sex, grade level (GL), 
number of Siblings (SIB), extracurricular activities (XACT), mother’s 
education (ME), and father’s education (FE). The effects of the six variables 
are tested through an F-test; regression weights are determined for random 
variables and fitted constants for fixed variables. Somewhat arbitrarily, GL 
and SEX were designated as fixed variables ; the remaining four (SIB, XACT, 
ME, and FE) as random variables. 


C. RESULTS 


So few items were placed in Category 1, Bodily Self, that analysis of vari- 
ables related to this category was not possible. For the 46-item MSCB across 
the entire sample of 225 students, a total of only 21 items were identified, or 
classified, in Category 1. Category 1, then, was omitted from further study. 

The F-ratios, tests of significance for each variable in the seven analyses, 
are presented in Table 1. Two of the random variables arc significant vari- 
ously in three analyses, while GL and SEX are significant in five analyses. 
The remaining two random variables, FE and SIB, are insignificant in all 
analyses. 

TABLE 1 


F-Ratios FOR EACH VARIABLE IN THE LEAsT-SQUARES ANALYSIS OF 


VarIANCE FOR EACH CATEGORY 


Category 
Variable 2 3 4 7 8 
que 10.385**  12,569** 
evel 5233* 5.991% 6.257% 0.514 0.185 . 
Sex a900 1421 65.7308" 9.1248 1.616 18.690* $6.184** 
d Кшз 0.909 0.156 0412 0.199 6.611 0.158 0.071 
ather’s 
education 3.019 0.023 0.650 — 3.688 3.092 0.175 0.285 
other's 


education 0.435 0.031 5.002* 4.540* 7.996** 1.985 0.030 
xtra- 
curricular 


activities 1.273 1.181 11.012#5* 6.510% 1.986 0.658 5.324* 


ен Significant at the .05 level. 
Significant at the .01 level. 


The regression weights for ME and XACT, significant at least at the .05 
level, are presented in Table 2; ME is negatively related to Categories 4 
and 6 (Sense of Self-Extension and Sense of Self as a Rational Coper). A 
Positive relationship is seen between Category 5 (Self-Image) and both 
variables, ME and XACT. The lower the mother’s education, the more 
Students made responses that were classified in the Self-Extension and Ra- 
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tional Coper Categories; the higher the mother’s education, the more students’ 
responses were classified in the Self-Image Category. Similarly, the number 
of extracurricular activities is positively related to the number of responses 
classified in the Self-Extension and Self-Image Categories, and negatively re- 
lated to the number of items that could not be clearly classified. 


TABLE 2 
SIGNIFICANT REGRESSION WEIGHTS FOR RANDOM VARIABLES IN THE ANALYSES 
Category 
Variable + 5 6 $ 

Mother's 

education —2137 3447 —5178 - 
Extracurricular 

activities 4052 7091 Е —.2191 


ja EE пут ee ee 


The means for the fixed variables, indicated as significant in Table 1, are 
shown in Table 3. Apparently, female students tend to make relatively more 
responses than do males in the categories of Continuing Self-Identity, Self- 
Extension, and Propriate Striver, while the male students make more responses 
than do females in Self-Image and Unclassified categories. The pattern of 
means across grade levels is, however, irregular. 


TABLE 3 
PERCENTAGE MEANS FOR FIXED VARIABLES THAT WERE SIGNIFICANT IN THE ANALYSES 
Category 
Variable 2 3 4 5 7 8 
Grade level 
9 1.6084 3.9434 8.5444 — 12271 4.5939 
10 1.1571 2.5493 10.2565 — 2.7301 3.5435 
11 1.5297 3.5515 9.8346 — 1.7845 2.5420 
3 12 1.2892 3.3190 8.7733 — 2.2031 2.7772 
x 
M 1.2597 — 7.9564 11.5703 1.5371 4.3677 


F 1.5325 — 10.7580 9.2031 2.4353 2.3605 


D. Discussion 


When statements of the MSCB were examined, there were so few responses 
in Category 1 that comparisons were not possible. Lack of statements which 
express bodily or physical awareness may have been an index of the repressive 
influence of the culture. Failure of the students to respond to bodily charac- 
teristics, sexual drives, or organic sensations in any of their projected responses 
points most vividly to some control over the open discussion of these matters. 
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The absence of any measurable cathexes toward bodily self is, indeed, sur- 
prising in the light of the rapid physical changes that take place at pubescence 
during carly adolescence. The vacuity of concern for sex mores and 
problems, however, may reflect the inability of the MSCB to elicit statements 
vf bodily self. 
- Data supported the hypothesis that the various senses of self continue to 
change and develop during the adolescent years, although the pattern of devel- 
‘opment is somewhat irregular. When grade level was examined as a mean- 
jngful variable in relation to self-concept, four of the emerging stages of self 
‘showed significant modifications from freshman to senior years: (а) Continuing 
Sclf-Identity, (b) Self-Esteem, (с) Sense of Self-Extension, and (4) Sense of 
Self as a Propriate Striver. It is noteworthy that the Unclassified Category 
‘contained a significantly different percentage of responses throughout the grade 
levels; in general, older students reported more clearly classified responses 
than did younger students. 
_ Continuing Self-Identity, Category 2, reflects a sense of organic continuity 
of the neuromuscular system. Basically, responses indicate a special distinction 
between the self and others, The identity, reported here, could be an object, 
place, or a group, which the subject uses to mirror himself. When the high 
‘school boys and girls were viewed along the dimension of Continuing Self- 
Identity, significant differences were obtained for grade level and sex. In 
general, freshman and juniors show more concern with this stage of devel- 
opment than do sophomores or seniors, and females have more responses in 
this category than do males. Results of the responses in the Category of Self- 
Identity are consistent with the feelings of many authors (1, 2, 4); in that 
ће searching for identity is acute during adolescence. 3 
When responses having reference to the Sense of Self-Esteem or Pride 
(Category 3) were examined, modification of the self-concept in relation to 
- grade level was found to be significant. The feeling of importance and the 
need for autonomy seemed to decrease from freshman to senior year with the 
heaviest drop in self-responses during the sophomore year. The strong need 
for independence appeared to be most intense during the freshman. year. A 
_ Rradual decrement in this intensity was noted in the junior and senior years, 
but a significant drop was reported in the sophomore year. It can logically be 
inferred that elementary boys and girls enter the more prestigious and re- 
spected domain of the freshman high school curriculum with a strong desire 
Bo lead, to be mature, to demonstrate individual potential. With the con- 
tinuance of high school participation in the sophomore year, the student feels 
need to belong, to be a member of the team, to be a loyal participant 
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in school plays, clubs, and projects. Some of the strong drive for independence 
characteristic of the initiating freshman may be modified by peer conformity, 
As the junior and senior years approach, the spirit of high school leadership, 
of being a captain of a team, an officer in a club, a director of scholastic activ- 
ities, teases out the spirit of independence and a sense of pride. Thus, the 
Sense of Self-Esteem can be viewed as maintaining a pattern of arsis and thesis 
and then arsis, emerging with relative stability as the strength and maturity 
of high school accomplishments are obtained. None of the other variables 
placed a significant role in developing a Sense of Self-Esteem or Pride. 

Allport (1, p. 283) hypothesized that the limits of the self were “rapidly 
extended” during adolescence. The Sense of Self-Extension, Category 4, ap- 
peared to discriminate high-school students by grade and by sex. The fact 
that sophomore boys and girls revealed higher mean responses than did any of 
the other grades may point to the marked increase in activities for students 
in this class. The more accepting attitude toward dates, dancing, sports, clubs, 
etc., undoubtedly contributed to the interest in group activities. Mean scores 
indicated a rapid extension of "outside interests" and, then, a leveling off and 
stabilization of these cathexes and identifications throughout the remainder 
of the high school cycle. 

Two further variables that have a determining function in student responses 
of Self-Extension are the formal education of their mothers and the extra- 
curricular activities in school. More advanced formal training of mothers 
might indicate an increased need for achievement. More participation in 
extracurricular activities by students may well be determined by their mothers’ 
high need to be successful and socially approved. Sons and daughters condi- 
tioned in the home to a high need for achievement and social desirability may 
strongly extend themselves to various extra activities at school. 

Results indicate that all the adolescents in the sample were concerned with 
a Sense of Self-Image, yet no marked difference was noted among the four 
grade levels. Girls were observed to be significantly different from boys in 
reference to their present ability, status, and roles. Apparently, the adolescent 
boy becomes more positive and gains more confidence as he continues his 
experiences of high school. Perhaps sports, as much as any subject, anchor his 
interests and help the male student to maintain a satisfactory and healthy 
self-image. The female adolescent’s self-image undergoes continual development 
and becomes gradually more cathected toward self-adornment and attractive- 
ness. Her anxious concern to be chosen, to possess the attentions of the op- 
posite sex, may at times arouse dissatisfaction and negative attitudes toward 
the image of self. This may account for the lower mean responses for girls; 
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‘may have repressed undo attention to self-image on the paper and pencil 
Mothers’ formal education had a determining influence upon the forma- 
of the adolescent's self-image. Family status, which is highly correlated 
th degrees and years of schooling, would enhance or detract from the position 
the high school boy or girl enjoys in school. It was not surprising that 
` participation in extracurricular activities had an even more important effect 
upon the development of a sense of self-image. Within these activities, students 
possess a feeling of belonging and develop increased ego strength. 
When the results were examined relative to the Sense of Self as a Rational 
- Coper, it was noteworthy that only the formal schooling of the mother demon- 
ted any significant difference. It is easy to interpret the relationship between 
educative backgrounds of parents and logical processes of their chil- 
but that no other variable maintained any significant determining influ- 
се in such coping behavior makes for difficult discussion. At least one infer- 
can be made. As Piaget (5) suggested, the formal logic period of thinking 
at 12 years of age and, as Allport maintains that rational coping is 
entative of the twelfth year of development, it may be that rational 
is stage dependent at this adolescent period of growth from 12 to 17 
Manner and styles of rationally coping with reality may differ, but the 
t process of dealing reasonably with environmental presses may be as 
onal in one grade level as in another, and may be at least utilized by 
sexes in the maturing and developing cycle of the adolescents. Types and 
of reasoning may differ with sex and formal training, but the human 
to think rationally remains relatively stable following its incipiency at 


The Sense of Self as a Propriate Striver involves planning as important to 
а complete sense of self. Vocational plans of the adolescent are a function of 
ability to delay gratification of immediate needs. Although the facility 
ith which a boy or girl can forego an immediate reward in order to gain a 
ter reward is highly related to age, sex, culture, and family structure, 
the formal education of the mother and father were not specific deter- 
ts of propriate striving in the youth examined here. Among the variables 
d in this study, planning for long range, vocational, or final goals 
red to be only a function of sex and grade level in school. The high 
n responses indicated that sophomores and seniors were more concerned 
ut propriate striving than students in the other classes. In each high school 
girls reported a larger number of statements reflecting a Sense of Self as 
Topriate Striver than did boys. 

t should be noted that there was a significant difference among the four 
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classes in the number of unclassified responses. As grade level increased, the 
mean responses of unclassified sentences decreased. Perhaps more definitiveness, 
more maturity, more self-assurance of older students brought about a greater 
quantity and clearer identification of MSCB responses. Boys showed a greater 
mean response for Category 8 than did girls. It appears that female students 
are more willing to make statements about their self-concepts and can define 
them with more clarity than their male classmates. The negative regression 
weights for responses in Category 8 (Table 2) indicate that students who are 
more outgoing and pursue a greater number of extracurricular activities are 
more willing to describe their self-concept feelings and do so with greater 
meaning and clarification than students who are minimally active in extra 
activities at school. 


E. SUMMARY 


An attempt was made to measure the evolving sense of self, as defined by 
the McKinney Sentence Completion Blank, of 225 students in grades 9 
through 12. Developmental changes in grade level were observed’ on four 
aspects of the self: Sense of Continuing Self-Identity, Sense of Self-Esteem, 
or Pride, Sense of Self-Extension, and the Sense of Self as a Propriate Striver. 
Relations of the adolescent self-concept were also compared with sex, fathers’ 
and mothers’ formal education, and extracurricular activities of the students. 
Data supported the use of the McKinney Sentence Completion Blank to eval- 
uate the modification of the self-concept through the high school years. 
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VOLITION AND ATTRACTION IN EVERYDAY LIFE* ! 


The Ohio State University 


Timotuy C. Brock AND LEE ALAN BECKER 


А. PROBLEM 


Volition, the person's feeling that he has some control about whether or 
not to put himself into a situation, has increased in theoretical interest fol- 
lowing the studies of cognitive dissonance by Brehm and Cohen (1). The 
present study attempted further delineation of “volition” by examining the 
relationship between free choice and the attractiveness of everyday activities. 

Many of the forces that seem to control the sentiment-changing potency of 
exposure to inconsistency—degree of effort, of reward, of justification, and 
so forth—may act through modification of volition. For example, the person 
exposed to counter-propaganda for minimal justification (3) may feel the 
exposure was his own choice to a greater degree than someone exposed for 
ample justification. To follow Lewin's (9, p. 267) distinction, when induced 
forces propelling a person toward confrontation by inconsistency are weak 
or absent, the person may come to feel that his own forces push him into the 
dissonance-arousing situation. It is possible that the variables which govern 
magnitude of dissonance may not act at all if they do not act through variation 


in volition (2). 


The intervening role of volition may be illustrated from an experiment by 
Cohen (5) in which persons were offered differing magnitudes of small reward 
for espousing a position opposed to their own. As predicted from dissonance 
theory, attitude toward the discrepant position increased in favorableness as 
Teward for espousing the position decreased. However, ancillary data showed 
that feeling of volition also increased as reward decreased. Hence, differing 
Tewards for compliance in discrepant behavior may induce dissonance by 
means of the effects those rewards have on magnitude of volition, That con- 
comitant variation in volition may be necessary was suggested by Rosenberg's 
(10) recent study. Rosenberg not only failed to replicate Cohen’s (5) effect 
of teward on attitude change, but he also failed to reproduce Cohen’s effect 


of reward on volition. 
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The Brehm-Cohen (1) casting of the volition variable is a major revision 
of the original statement of dissonance theory (7). The original formulation 
would predict that dissonance increases with increasing relative attractiveness 
of the alternatives in a choice. However, according to Brehm and Cohen (1), 
if the person were given a large reward for making a choice, volition would 
be low and so, presumably, would be the net dissonance. This analysis sug- 
gests that volition may affect dissonance independently of the ratio of dis- 
sonant to consonant cognitions and that choice, the selection of one alternative 
with relinquishment of another, is conceptually and operationally distinguish- 
able from volition. 

To test the above possibilities in a more crucial fashion, Cohen and Brehm 
(6, р. 205) attempted to produce dissonance by creating variations in volition 
in a way that goes “counter to the theoretically expected relationship between 
force and attitude change.” Cohen and Brehm had pledges sign up for a long, 
boring task under conditions of High Coercion (“if you don’t sign up we'll 
get you thrown out of the fraternity”) and Low Coercion (“if you don’t 
sign up we'll see that you get some extra hours of pledge duty"). They found 
that both volition and perceived threat were higher under High Coercion than 
under Low Coercion. The greater the threat, the less should be the dissonance 
and consequent favorable evaluation of the boring task. The greater the voli- 
tion, on the other hand, the greater may be the liking for the discrepant act. 
The results supported the latter alternative and thus showed how variation 
in volition is sufficiently powerful to countervail the presumed dissonance- 
decreasing effects of high felt threat and annoyance. Cohen and Brehm (6, 
р. 209) concluded, “оп the whole, then, the data support the proposition 
that volition can effect the magnitude of dissonance independently of the ratio 
of dissonant to consonant cognitions.” 

The reliability of the differences in the above experiment was unimpressive 
and the experiment was not specifically designed to test the distinctive role 
of volition in dissonance theory. Consequently, Leventhal and Brehm (8, 
р. 214) undertook a more direct approach: “It is our contention that the 
amount of dissonance created by a choice between two alternatives is a function 
not only of the relative attractiveness of the alternatives themselves but also 
of the degree of volition involved in the person's having to make a choice.” 
Volition about being in an experiment was varied indirectly and then the 
subject was acquainted with several tasks, evaluated the tasks, chose опе of 
the tasks, and then revaluated the tasks. Dissonance reduction, increased 
liking for the chosen task and decreased liking for the unchosen task, was 
greater in the presumed high volition condition than in the presumed low 
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volition condition. This pioneering experiment, therefore, independently ma- 
nipulated volition and choice and showed that differences in volition can make 
a difference in subsequent postchoice responses. Unfortunately there were 
complexities and ambiguities in the procedure and data. It was possible that 
the tasks were not really disliked (and therefore not dissonance-arousing) and 
that the mechanism for varying volition, amount of grade credit points offered, 
artifactually affected the initial liking for the tasks. 

The paradoxical effect in the Cohen and Brehm (6) study, volition in- 
creasing with coercion, and the ambiguities of the Leventhal and Brehm study 
(8) left important questions about volition and dissonance without a definitive 
reply. Consequently, an experiment was conducted in which volition was pitted 
against justification to determine which variable was prepotent in dissonance 
arousal. The results of that study (2) showed that, for all three levels of 
justification, enhancement of a boring task increased in proportion to magni- 
tude of volition. A relationship between lower justification and higher liking 
for the task was obtained, however, only under a very low volition condition. 
The results provided strong support for the proposal that volition is the central 
mechanism in dissonance arousal, 

The present approach examined persons in their everyday setting to see 
how, in real life, volition, felt coercion, and activity attractiveness are inter- 
related, An instrument, described in detail below, was designed to measure the 
attractiveness of a given everyday activity, the number of perceived alternative 
activities, and the choice and obligation felt by the person as he embarked 
upon the given activity. The first hypothesis was that expression of volition 
will increase, and expression of obligation decrease, as the number of reported 
alternative activities increases. To illustrate, a person who chooses to bowl, 
and in so doing rejects movies, card-playing, reading, dancing, and watching 
television, will perceive he had more choice in bowling than a person who 
chose to bowl and saw reading as his only alternative. The latter person will 
feel more obliged to bowl than the former because the paucity of alternatives 
increases feeling of constraint. The hypothesis implies an inverse relationship 
between volition and felt coercion, although that relationship was not ob- 
tained by Cohen and Brehm (6) and only irregularly by Brock and Buss (4). 

The second hypothesis was that enjoyment of the activity will be directly 
correlated with feeling of volition and inversely correlated with „sense of 
obligation. The second hypothesis was in accord with laboratory findings sum- 

` marized above. The third hypothesis was that the predicted relationships will 
be greater in magnitude for persons above the median than for persons below 
the median in number of reported alternative activities. Two assumptions were 
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involved. First, the relationships specified in the first and second hypotheses 
would be stronger, the greater the psychological commitment to the activity. 
Second, psychological commitment to an activity increases as the number of 
rejected alternative activities increases. The second assumption is based on the 
presumed effect of alternatives on volition and the augmentative effect of 
volition on psychological commitment. 


B. METHOD 


Fifty-three men and 93 women in introductory psychology sections at 
Iowa State University completed a three-part questionnaire entitled “Memory 
for Personal Experience.” The instructions for Part I were: 

As part of a larger project concerned with ability to remember personal 
experiences, this questionnaire seeks to discover how well people can 
remember what they did the day before and some of their feelings about 
their activities. Some time periods for yesterday are listed below. In no 
more than 10 words describe what you did during each time period 
yesterday, If you did more than one thing, list only your major activity. 


Ten half-hour periods were listed vertically with space for the respondents’ 
description next to each time period. The periods were: 11 {00 to 11:30 AM, 
noon to 12:30 PM, 1:00 to 1:30 PM, 3:00 to 3:30 PM, 3:30 to 4:00 PM, 
4:00 to 4:30 PM, 4:30 to 5:00 PM, 7:00 to 7:30 PM, 7:30 to 8:00 PM, 
and 8:00 to 8:30 PM. The instructions for Part II were: 


People are often aware of alternative courses of action. Instead of going 
to Iowa State University a student might have gone to State University 
of Iowa or he might not have gone to college at all. Instead of going to 
the movies for relaxation a student might feel that he could just as well 
read a novel or chat with friends or go bowling or take a nap. When a 
person is doing one thing, he may be aware of a few things or many 
things that he might just as well be doing. For each time period in Part I 
list, in a phrase or two, the other things you might just as well have done 
during that particular time period. 


The 10 half-hour periods were listed again with space for the respondents t9 
mention “things I could just as well have been doing.” 

Part III instructions were: “Answer three questions concerning your actual 
activity in each time period. Draw a line through each scale to indicate yout 
answer." The 10 time periods were listed again and each period was accom- 
panied by three 50-point scales with each tenth point appropriately labeled. 
The first scale, to measure volition, was “During this time period what I 
did was my choice”; it was accompanied by labels running from “Entirely” 
to “Not at all.” The second scale, “During this time period my feeling of 
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being under obligation to do what I did was,” had labels running from “Сот- 
plete” to “Nonexistent.” The third, to measure attractiveness, was “I enjoyed 
my activity during this time period”; it had labels running from “Completely” 
to “Not at all.” The reported alternatives for each activity in each time-period 
were counted. The three scales were scored from zero (low) to 50 (high). 


C. RESULTS 


The correlations shown in "Fable 1 are average correlations computed from 
the sums of cach subject's covariance and sums of squares. Covariances of the 
six combinations of the variables and sums of squares of cach of the four 
variables were computed for each subject within the 10 time-periods. These 
components were then summed across 146 subjects to form the component sum 
of covariances and sum of sums of squares for the correlations shown in 
Table 1. An alternative procedure would have been to correlate the average 
subject scores. That is, for each subject the average variable score across the 
10 time-periods could have been found and the correlation between the average 
scores for the 146 subjects computed. The method of average correlations is 
presented rather than the method of correlations of average scores because the 


TABLE 1 
AVERAGE CORRELATIONS BETWEEN VOLITION, OBLIGATION, NUMBER OF ALTERNATIVE 
ACTIVITIES, AND ATTRACTIVENESS OF ACTUAL ACTIVITY 


Sex Alternatives 
Total Men Women Few" Many? 
Relationship (146) (53) (93) (70) (76) 
Volition and 
number of 
alternatives 273%* 197 .304** 355** 236* 
ligation and 
number of 
alternatives —.206* 185 217% —313** —315** 
Attractiveness 
and volition 312%* 208 356** 313% ise 
«Attractiveness 
and obligation —311** —219 —342* —287* 330** 
ttractiveness and 
number of 
alternatives 068 074 066 057 076 
olition and 
obligation _agee Tae este | cag до 
Number of cases 146 53 93 70 16 


Note: Ns in parentheses. 
Two-tailed P less than .05. 
Wo-tailed 5 less than .01. н ENS 
elow the median in number of alternative activities. 
bove the median in number of alternative activities. 
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results of the former method are more clearly generalizable to the sample as 
a whole, while the second method indicates relationships due to individual 
differences. In general, the correlations produced by the correlation of average 
scores were in the same direction but weaker than those produced by the 
average correlations method. 

Average correlations based on the total sample, N = 146, are shown in 
the left-hand column of Table 1. The other columns show the same relation- 
ships for men and women, and for men and women below and above their 
respective median in number of considered alternative activities. The mean 
number of alternatives, .80, for few-alternatives men differed quite reliably 
(t = 4.8, df = 51) from the mean score for the many-alternatives men, 1.80. 
Similarly, the mean number of alternatives, .57, for the few-alternatives women 
was reliably (2 = 6.6, df = 91) lower than the mean for the many-alternatives 
women, 1.49. The sex and many-few classifications did not affect mean scores 
for volition, obligation, or attractiveness. 

In support of the first hypothesis, the first and second rows of the first 
column in Table 1 show that the number of considered alternatives was posi- 
tively related to volition and inversely related to obligation. An implication 
of the hypothesis was that volition and obligation would be inversely related. 
The sixth row of column one strongly confirmed that prediction. 

The second hypothesis was that enjoyment of the actual activity would be 
directly related to volition and inversely related to obligation. The correlations 
in the third and fourth rows of the first column in Table 1 provided strong 
support. 

"There were no significant differences in the correlations for men and women. 
Columns two and three of Table 1 show that the correlations are all in the 
directions predicted. The correlations for the women appear stronger than 
those for the men in all but two cases, but there were no significant differences. 

The third hypothesis was that the relationships predicted in hypothesis 1 
and 2 would be stronger under many than under few alternatives. Note that 
opportunity for higher correlations was spuriously increased under many alter- 
natives, since variability in number of alternatives was two to three times 
greater under many than under few alternatives. Although five of the six 
comparisons were in the predicted directions, the differences are small and 
t-tests of the transformed correlations yielded no differences approaching 
significance. 

D. Discussion 


Our waking hours are programs in which we select activities, carry them 
out, select others, execute them, and so on, Present respondents had no difficulty 
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ng their major activity during cach of the specified half-hour periods. 
he fifth row of Table 1 shows clearly that the existence of available alter- 
© activities, by itself, does not increase the attractiveness of the selected 
activity. Only when awareness of alternatives augments volition is attractive- 
ness of the chosen activity also increased. The presence of alternatives increases 
likelihood of volitional investment in the chosen activity. If volition is 
ted, the attractiveness of the chosen activity increases. In sum, attractive- 
i and alternatives are related through their relationships with volition and 
ration. 

‘Although correlational data are always susceptible to multiple interpreta- 
the present data, considered in conjunction with experimental results 
ying similar measures, have shown that relationships uncovered by 
tory experiments on cognitive dissonance (1) may prevail in real life. 
vity, whether intrinsically distasteful or not, is enhanced when volition is 
eased or obligation decreased. This relationship appeared to be the same for 
th men and women. No evidence was found that women “enjoy” obligatory 
tivities more or less than men do. 

An alternative possibility is that respondents reported they felt choice in 
ying out attractive activities and said they were obligated to carry out 
joyable activities. Note that measurement of number of alternatives pre- 
ded measurement of enjoyment and volition. The alternatives variable did 
not influence attractiveness but it was positively associated with volition. From 
is consideration it seemed more likely that the voluntary character of the 


‘activity affected its attractiveness than that their enjoyment led respondents 


say they had much choice in selecting the activity. Unfortunately the data 


The overall correlation of —.47 between volition and obligation was sub- 
‚ but an even higher order of magnitude might have been expected with 
simplistic assumption that volition and obligation are opposite sides of the 
coin, One is reminded of the Brock-Buss (4) experiment in which com- 
ble measures of volition and obligation were employed: the obtained rs 
not approach values statistically different from zero. Hence, perceived 
and perceived constraint may occur together. At least that is what 
Ppeared to happen in the Cohen-Brehm (6) experiment in which High 
ercion produced both high perceived threat and high expression of volition. 
leed, in many of the dissonance experiments (2, 4) the subject is constrained 
Temain in the situation but, at the same time, his sense of volition as he 
in the designated activity can be sufficiently powerful to influence 
uation of the activity. Increased understanding can be expected from 
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research in which constraint and freedom are treated as orthogonal variables 
and separately manipulated. The psychology of volition and, in particular, 
the proposition that variation of volition is possible within an overall context 
of constraint and threat has important implications for persuasion and the im- 
plementation of social changes, such as racial integration and adoption of 
public health measures. 


E. SUMMARY 


Volition, the person's feeling that he has some control about whether or 
not to put himself into a situation, increased in theoretical interest with the 
expansion of research on cognitive dissonance. The present study attempted 
further delineation of “volition” by examining the relationships between voli- 
tion, obligation, and the attractiveness of everyday activities. For each of 
10 half-hour time periods, 146 undergraduates indicated how much they 
enjoyed their major activity during the time period, the degree of volition they 
felt, the degree of obligation, and how many alternative activities they re 
jected. The first hypothesis—as the number of rejected alternatives increases 
volition increases and obligation decreases—was supported. In addition, voli- 
tion and obligation were inversely related. The second hypothesis—enjoyment 
of an activity is directly correlated with feeling of volition and inversely cor- 
related with sense of obligation— was also supported. A third hypothesis, which 
predicted that the above relationships would be stronger when many rather 
than few alternatives were rejected, was not confirmed. 

"The results showed that relationships uncovered by laboratory experiments 
on cognitive dissonance prevail in everyday life. The data also suggested that 
variation of volition is possible within an overall context of constraint. Treat- 
ment of constraint and volition as orthogonal variables was indicated. 
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STUDENTS’ ATTITUDE TOWARD LIVING WITH 
DIFFERENT ETHNIC GROUPS* 


Department of Psychology, University of Karachi, Pakistan 


S. M. Hareez ZAIDI 


A. THE PROBLEM 


With the increasing emphasis on international living, it seems desirable to 
find out how people feel about living with the members of other ethnic groups 
in a relationship of differential spatial contiguities. Such situations are no longer 
rare. It is assumed that observation of behavior in such situations may help in 
understanding the dynamics of interpersonal and intergroup stereotypes and 
prejudices, A large number of studies of social distance and stereotypes of 
different groups have been reported from various countries. Most of these 
studies, however, seem to utilize either the items on the Bogardus Social 
Distance Scale or the check-list method initiated by Katz and Braly (2). The 
present study puts the focus on the residential preferences as indications of 
social distance attitudes, Specifically, it is an attempt to understand the reac- 
tions of university students in Pakistan to situations wherein they are re- 
quired to indicate their residential preferences with members of various ethnic 
groups. 

B. PROCEDURE 

The respondents in this study are 107 male and 110 female students of the 
University of Karachi selected randomly on a 10 per cent sample basis. These 
students are from the university’s graduate population with an average age 
of 21.5 years, The sample represents an elite group whose attitudes may Ф 
be typical of the entire population of Pakistan. However, whether or not 
attitudes represented by the results are typical of the entire population, E 
are still important, since they reflect the thinking of a group which wou 
in future assume a dominant political, social, and educational role in Pakistan 


a he study, though patterned after the Bogardus 
i i tudy, though р: S 
he instrument employed in the study. an ndenis grilling 


Social Distance $ d in its first six it 

cale, referred in 1 7 
ness to accept a member of an ethnic group as а room pat as orare 
in a (college) hostel, as a resident on his street, as a resident in ty, 
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as a domicile (resident) of his country. The seventh item represented a nega- 
tive attitude of not accepting him in any capacity. No list of ethnic groups 
was provided in order to give the respondents the freedom to name their own 
group and to indicate the type of association they desired to establish with 
a member of that group. This was to avoid a situation wherein a respondent is 
forced to place a group under one of the given categories even if he would 
not have thought of that group in a situation of free choice. 


C. ANaLvsis oF RESULTS 


"Tables 1 through 4 incorporate the results of the study. The responses have 
been tabulated in terms of percentages. Table | gives the percentage of male 
and female respondents who have indicated their preference for an ethnic 
group under each item. Since there was no significant difference between 
the preferences of the male and the female respondents, their responses have 
been combined in Table 1. The table indicates that the highest preference is 
given to associating with Pakistanis in all the different situations of residential 
contiguity. The Germans are the second most preferred group. The female 
respondents, however, showed relatively less preference for the Germans when 
compared with the male respondents. Another significant feature of Table 1 
is the relatively higher percentage of negative responses toward Russians, 
Indians, Americans, and the British. This may seem to indicate that, given 
the choice, the respondents would tend not to accept these groups in any 
capacity. The female respondents under this negative category also are slightly 
different from the male respondents in so far as they are relatively less willing 
to live with Indians and Russians. The difference, however, is very slight. 


TABLE 2 
DEGREE OF AFFINITY SHOWN BY MALE AND FEMALE RESPONDENTS 
Affinity ratio 

National group Males Females 
Pakistani 2; ee 
German pert p 
American ae 22 
British v ^ 
Indian А a 
ee 2s 30 
Japanese ph e 
French Е 3 
Turk n x 
Iranian H e 
Chinese ти ЕЯ 


. Note: Rank difference correlation (rho) between the male and female Affinity Ratios 


1% 714, significant at .01 level of confidence. 
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Table 2 indicates the degree of affinity in terms of closer residential амен 
ciation with each ethnic group. The higher figures indicate greater degree of 
affinity. The detailed justification for using Affinity Ratio as a measure іа 
place of the Average Social Distance has been discussed elsewhere (4). Briefiy, 
the method of calculating the Affinity Ratio is as follows: the first item is 
the scale from the left is assigned 6 points; second, 5 points; third, 4 points; 


fourth, 3 points; fifth, 2 points; sixth, 1 point; and seventh, —1 point. The | 


total mentions of any one national group under all the seven items are divided 


by the sum total of these scale points multiplied by the number of ethnic — 


groups (20 x 12 = 240) to obtain the Affinity Ratio for a group. 

Separate ratios for the male and the female respondents are given in Table 
2. A highly significant correlation (tho = .714) between the two sexes indi 
cates the close similarity of their attitude toward associating with variout 
ethnic groups. The low figures indicate the infrequency with which a group 
is mentioned by the respondents, which reflects the lack of affinity for it. The 
negative sign reflects the higher mention of the group under the seventh item 
a compared to its total mention under the first six items. 


"Tables 3 and 4, respectively, present the combined and separate responses — 


of the two sexes about the 12 groups when these arc classified as Muslim с 
Non-Muslim and as Asian vy. Non-Asian groups. Chi square values have been 
obtained to test the significance of the associations with Muslim and Asian 
groups as against Non-Muslim and Non-Asian groups. 

In Table 3, chi square values about three types of associations—namcly, 
room partner (x? = 10.00,  < .01) ; resident in the same street (x? = 10.09, 
# < 01), and would not accept (x? = 60.05, p < .001)—are statistically 
significant, showing the greater preference for Muslim groups in these three 
degrees of residential associations as against Non-Muslim groups. These results 
seem to indicate that the present respondents prefer Muslim groups for rela- 
tively permanent and closer associations, but that they would not mind living 
with Non-Muslim groups when the nature of the residence is temporary—for 
example, living in the same hostel (dormitory) and studying in the same 
college—or when the residence is spatially not too close—for example, being 
in the same city or country with members of Non-Muslim groups. Further, 
it also is indicated by the results that, given a choice, subjects would not accept 
members of Non-Muslim groups at all. 

Table 4 is indicative of a high degree of ethnocentrism in so far as all the 
chi square values are highly significant. It appears to be Asia for Asians in 
a broader global perspective. The chi square values for all the seven items 
from left to right are 20.52, p < .001; 20.09, р < .001; 10.61, p < 0l; 
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20.33, p < 001; 10.16, p < 01; 10.31, p < O1; and 7.16, p < 01. This 
ledicates that the respondents prefer Asians to Non-Asians any time and in 
all types of associations, and that they tend not to want to accept Non: Asians 
{а any capacity. The female respondents, as indicated in Table 4, are bess will- 
ing to accept Non-Asians than are their male counterparts. ‘This attitude of 
the respondents is in sharp contrast with those of a study of social distance 
in the Philippines (1), in which the students showed a preference for persons 
who were assimilated to Western culture and who leaned toward the Caucioid 
in physical appearance. The author himself admits that such a result is in 
direct conflict with the subjects’ geographical location and in sharp contrast 
to the fact that the nearby areas are predominantly Muslim, Buddhist, of 
Hindu. 


D. CowcLUSIONS 


The following tentative conclusions may be drawn from the results of the 
present study: (а) ‘The male and the female respondents have similar attitudes 
toward associating with members of other ethnic groups. (b) The majority 
of the respondents prefer residential association with Pakistanis. The Germans, 
however, are the most preferred group outside their own. (с) A good part 
of the respondents do not want any type of association with the Russians, 
Indians, Americans, or the British. (4) Muslims as a group are highly pre- 
ferred as residential associates. Non-Muslims, however, may be accepted only 
in temporary and not too close associations. (е) As a large group, Asian 
national groups are given preference to all Non-Asian groups for all types 
ОЁ residential association. 

It may, however, be mentioned in conclusion that the present data were 
collected about two years ago. Due to a number of intervening political and 
*conomic changes in the international scene, it is quite likely that some of the 
negative attitudes, especially those against Russians, may have undergone con- 
siderable change in the positive direction; while some, especially toward the 
Indians, may have become more intensely negative (4). Further, it is always 

{ one group will actually behave 
toward members of another group in accordance with their attitudes as ex- 
pressed in interviews or in response to questionnaires. La Piere's now famous 
Study (3) showed that one's actions do not always conform to one s attitudes. 


E. SuMMARY 


A study with an instrument patterned after the Bogardus Social Distance 
le was conducted to find out the residential preferences of 217 male and 
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female students of the University of Karachi in relation to 12 ethnic 
The results indicate the high preference of the respondents for Muslims 
Asians as against Non-Muslim and Non-Asian groups. The most pre 
Non-Muslim and Non-Asian group are the Germans. 
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ACCURACY OF SOCIAL PERCEPTION OF PHYSICALLY 
HANDICAPPED AND NONHANDICAPPED PERSONS* 


The University of Wisconsin and Indiana University 


Ricuarp Н. IxcweLL, RicHARD W. THoreson, AND STANLEY J. Smits 


A. INTRODUCTION 


The problem of reality of social perception has particular significance for 
the physically handicapped individual in light of evidence of his rejection by 
the nonhandicapped (1). However, the social perceptivity of the physically 
handicapped has been open to question. Arguments, pro and con, on this issue 
are listed by Kelley ef al. (11). Arguments that the physically handicapped 
should be more socially perceptive than their nonhandicapped counterparts 
are (a) physically handicapped persons as a weaker minority have need to 
develop enhanced social perceptiveness in order to manipulate the stronger, 
able-bodied majority; and (5) physically handicapped persons have a great 
need to know their social stimulus value so that they can react in a socially 
appropriate way with the nonhandicapped, and, therefore, make an extra 
effort to be sensitive and to gain maximally in social information-gathering. 
Arguments that physically handicapped persons are less socially perceptive are 
(a) physically handicapped persons find a comparison between themselves and 
normals to be highly unflattering to themselves, and, as such, will cease making 
such comparisons utilizing denial mechanisms and perceptual defense ; and (b) 
the nonhandicapped present such a socially blank facade to the physically 
handicapped that the physically handicapped have little opportunity to learn 
perceptual accuracy. That is to say, the nonhandicapped attempt to spare the 
feelings of the physically handicapped ; in doing so, their behavioral repertoires 
become so constricted that the physically handicapped are given few percep- 
tual cues. у : 

The measurement of social perception, in general, has been subject to inten- 
sive review in recent years (3, 4, 6, 7, 9, 12). It is the authors’ contention 
that the present controversy is caused, in part, bya failure of investigators to 
distinguish between two separate dimensions of empathy: viz., impersonal social 
Perception measuring traits or dimensions which refer to everyone and to no 
one particular dyadic relationship (8) ; and social affectivity which measures 
— 
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the ability to perceive the feeling that one person has for another (15). In 
this study, which looks at person-to-person dyadic relationships, these dimen- 
sions are treated separately. 

The following hypotheses have been tested: (a) The physically handicapped 
are significantly less perceptive than the nonhandicapped in social affectivity 
(it was not anticipated that the null-hypothesis would be rejected for imper- 
sonal social perception). (5) Both groups, physically handicapped and non- 
handicapped, are better able to predict the behavior of persons belonging to 
their own group, rather than a different group, on social affectivity. (c) Both 
groups choose friendship choices more often from the nonhandicapped than 
from the handicapped group (again, it was not expected that the groups would 
differ significantly on impersonal social perception). 


B. METHODOLOGY 
1. Subjects 


The subjects were 12 physically handicapped and 12 nonphysically handi- 
capped University of Wisconsin students. The handicapped and the non- 
handicapped groups were matched for equivalence in age, race, sex, and 
grade-point average. All Ss were white females. All physically handicapped Ss 
had orthopedic disabilities which were clearly visible to fellow subjects and 
to the experimenters. Eleven of the handicapped Ss were disabled by polio- 
myelitis. The other physically handicapped $ was disabled by a spinal curva- 
ture. The types of handicapping ranged from confinement in a wheelchair to 
other orthopedic defects: e.g., atrophy of limbs and concomitant motor dis- 
abilities, distorted postures, etc. None of the handicapped was disabled by 
facial disfigurement or speech defect. 

The mean age of the handicapped group was 20.1 years, range 18 through 
22. The mean age of the nonhandicapped group was 20.3, range 19 through 22. 
The mean grade point for the handicapped group was 2.57, range 1.68 to 
3.4. The mean grade point for the nonhandicapped group was 2.57, range 
1.38 to 3.35. The grade points were computed on a zero-to-four scale. 

Because the ability to predict another’s behavior is based, in part, on the 
amount of social contact between the judge and the judged (14, 10), it was 
necessary to make sure that all Ss were strangers to each other prior to the 
experimental procedure. E interviewed and photographed potential Ss prior 
to the experiment. 


E displayed photographs of previously recruited Ss and asked if the potential 
S knew any of the persons shown. 


All Ss were requested to dress neatly for the experimental procedures, and 
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not to wear slacks or other like apparel which could hide disability: ¢.9., 
braces, atrophy, etc. This instruction also insured uniformity of dress and 
grooming, thereby preventing artificially inflated or depressed scores for 
deviantly dressed or groomed $5. 


2. Experimental Procedures 


The Ss were divided into three groups of eight Ss each. Each group con- 
sisted of four handicapped and four nonhandicapped Ss. Ё and his assistants 
met with each group at the prearranged time and place for the experiment. 

The Ss were all strangers prior to the experiment; therefore it was necessary 
for them to interact socially in order that they might have some basis for 
making their social judgments. The Ss were subjected to 90 minutes of formal 
task-oriented social interaction and 30 minutes of informal social interaction. 

The Ss were seated in a tight circle, without a table to reduce the possibility 
of hiding a physical disability and to reduce feelings of social distance between 
the Ss. E addressed the Ss as follows: 


This study is about what happens when people who are unacquainted 
with each other are brought together in a task-oriented situation. We are 
interested in such things as how the group decides on what task to per- 
form, how persistently the group works at its task, what morale factors 
result from such an assignment, and so on. The general task of this group 
will be that of carrying on a discussion. Here are a list of topics which 
you may wish to discuss. 

Mimeographed lists of discussion topics were then distributed to members 
of the group by E’s assistants. The following topics typify ће 16 included 
in the list: (а) Everyone should believe in and practice some religion. (b) A 
person really has very little control over his fate. (c) People with dangerous 
ideas are as bad as criminals and should be put in jail. The items were selected 
from an unpublished mimeographed draft by E. F. Borgatta (2). | 

The subjects were allowed approximately five minutes to examine the list 
of topics; then E addressed the group as follows: 

You may discuss any of the topics listed on the paper. You may discuss 
one of them or you may discuss more of them; you may discuss as many 
or as few as you wish. You may discuss any of the topics as exhaustively 
and as vehemently as you care to. You can carry on the discussion in 
any manner which is suitable to you. You may choose up sides, appoint 
a discussion leader, vote on topics to discuss—you may do anything you 
wish as long as you carry on the discussion. Do you have any questions? 

The discussions were conducted during two 45-minute sessions in a manner 

decided upon by the group. These sessions were interrupted by a 30-minute 


informal social interaction period during which refreshments were served. 
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At the end of the second 45-minute discussion period, experimental data 
were collected. The Ss were asked to move their chairs back from the circle 
to insure privacy during the data gathering. The first instrument administered 
was a social-desirability scale, The Positive Malingering Scale (5). This scale 
was used to assess impact of social-desirability on a S’s tendency to pick friend- 
ship choices from the opposite group. For example, a subject wishing to present 
herself in a good light might select too many from the opposite group to 
demonstrate her lack of prejudice. Subjects’ tendencies to pick from the oppo- 
site group were correlated with scores on the social-desirability scale. For 
the handicapped group, the relationship was Rho = —.36, not significant. 
For the nonhandicapped group, the relationship was Rho = .11, not signifi- 
cant. Thus, it could not be concluded that the Ss in this study responded in 
a way calculated to please the investigators, 

Next, the Ss were asked to rate all the Ss on The Impersonal Social Per- 
ception Scale (8). This scale has six items. The items are eight-point Likert 
scales. Each of the items measures one of the following dimensions: superior— 
inferior, selfish—unselfish, friendly—unfriendly, good leader—good follower, 
good sense of humor—poor sense of humor, adventurous and ready to try 
anything—cautious and afraid of new situations. The rating sheets were then 
collected and new rating sheets were handed out. For the second set, the Ss 
were asked to try to guess how each of the Ss rated themselves on all the 
above dimensions. 

After the second set of ratings had been filled out and collected, The Social 
Affectivity Scale (15) was administered with the following instructions: 

If you could continue this relationship of the past two hours beyond 
this laboratory, I would like you to list on the paper the person you 
would most like to continue a relationship with, then your second choice, 


and so on, until, in blank seven, you will list the person with whom 
you would least like to continue a relationship. 


Appropriate blanks were handed out by E's assistants. After these blanks 
had been completed and collected, E’s assistants would hand out new blanks, 
and E would then ask the Ss to try to guess where each of their fellow subjects 
ranked them. 

After the data from the Social Affectivity Scale had been gathered, each s 
was taken one at a time to a separate room and asked: “What do you believe 
the experimenters are trying to discover in this experiment?” The purpose of 
this conference was to determine how effectively the true purpose of the 
investigation had been concealed. It was felt that "aware" Ss might tend to 
respond in a socially desirable way, particularly in friendship choicemaking- 
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None of the Ss suggested that accuracy of social perceptions was part of what 
was being investigated. Thus, it was the consensus of E and his assistants that 
the Ss had been effectively deceived by the instructions which E had given them. 

Finally, E and his assistants rank-ordered the handicapped Ss in each group 
as to the visibility of their handicaps. 


3. Statistical Analysis 

The assumptions of interval scaling and normality of distribution could not 
be met for the Impersonal Social Perception and Social Affectivity Scales, A 
frequency distribution of the scores for both the Impersonal Social Perception 
Scale and The Affectivity Scales revealed a marked positive skewing. Thus, 
to determine statistical differences between groups, the Mann-Whitney U test 
for independent samples was used. To determine statistical differences of sets 
of numbers by the same group, the Wilcoxon matched-pairs signed-ranks test 
for related samples was used. Where correlational techniques were appropriate, 
the Spearman rank correlation coefficient was used. Alpha levels were set at 
the .05 level. 

C. RrsuLTS 
1. Hypothesis 1: Differences in Social Affectivity 


In order to test for differences on the Social Affectivity Scale, Ss’ predictions 
were compared with the actual responses, and deviation scores were computed, 
Each $ received a total Social Affectivity score derived by summing the devia- 
tion scores accrued from each subject: e.g., if an 8 guessed that she was first 
on a particular $'s list, and she was actually third, then she would be given a 
deviation score of 2 for that S, etc. 

The Social Affectivity Scale separated the handicapped group from the non- 
handicapped. The handicapped group was significantly poorer than the non- 
handicapped group at predicting behavior by the Social Affectivity Scale (U= 
37, significant at the .025 level, single-tailed test). Hypothesis I is, thus, 
supported with respect to the Social Affectivity measure. 

Group differences on the Impersonal Social Perception Scale were tested 
in the following fashion: each S's predictions for each item were compared 
with the actual response, and deviation scores for each item were computed, 

е deviation scores for a particular dimension for each $ were totaled. Thus, 
each § received a total deviation score for each dimension on the Impersonal 
Social Perception Scale. Total scores for the handicapped and nonhandicapped 
Were grouped; differences in central tendencies for the two groups were tested 
for each dimension separately, and for the total deviation scores from each 


dimension for each S. 
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The Impersonal Social Perception Scale separated the two groups on only 
one dimension: i.e, the nonhandicapped were significantly better than the 
handicapped at predicting behavior on “adventurous and ready to try any- 
thing—cautious and afraid of new situations” dimension (U = 33, significant 
at the .025 level, single-tailed test). The total Impersonal Social Perception 
score, the totaling of all the dimensions for each $, approached significance 
(U = 43, with a U of 42 being significant at the .05 level for a single-tailed 
test). The null-hypothesis could not be rejected with respect to the Impersonal 
Social Perception Scale. 


2. Hypothesis II: Differences in Predicting Behavior 


Hypothesis II was supported by the Wilcoxon matched pairs signed-ranks 
test of differences between the handicapped and nonhandicapped groups. On 
the Social Affectivity Scale the nonhandicapped were found superior at guess- 
ing the behaviors of members of their own group as opposed to guessing the 
behaviors of the handicapped group (N = 12, t = 7, significant at the .005 
level, single-tailed test). The handicapped also were better able to predict the 
behaviors of the nonhandicapped group as compared to persons belonging to 
their own group (2 = 13.5, N = 12, significant at the .05 level, two-tailed 
test). Thus, Hypothesis II is supported with respect to the behaviors of the 
nonhandicapped group using the Social Affectivity Scale. Hypothesis II is not 
supported with respect to the handicapped group on the Social Affectivity 
Scale. In fact, the results suggested a trend for the handicapped to predict 
the behavior of nonhandicapped persons with greater accuracy than the 
behavior of handicapped persons. Again the null-hypothesis could not be 
rejected with respect to the Impersonal Social Perception Scale. 


3. Hypothesis III: Differences in Friendship Choices 


The Social Affectivity Scale was used exclusively in determining friendship 
choices, For each S, three sociometric scores were computed: an overall 
sociometric score, where the ratings of all Ss, handicapped or nonhandicapped, 
are included; a nonhandicapped sociometric score, where the handicapped and 
nonhandicapped groups independently give mean sociometric ratings to mem- 
bers of the nonhandicapped group; and a handicapped sociometric score, where 
the handicapped and nonhandicapped groups independently give mean socio- 
metric ratings to members of the handicapped group. Sociometric scores were 
computed by assigning a numerical value to an 8 according to the position he 
occupied on any particular list. For example, if an § was chosen as first choice, 
she was given a numerical score of 7; second place 6; etc. 
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The overall sociometric scores of the nonhandicapped were significantly 
higher than for the handicapped group (U = 29.5, significant at the .01 level). 

In comparing the friendship preferences within groups and between groups, 
it was necessary to compute a mean sociometric score for each $, since for 
any single rating $ could rate only three from her own group, whereas she 
would be able to rate four from the opposite group. Thus, the mean ratings 
of each handicapped were compared to the mean ratings of cach nonhandi- 
capped by means of the Wilcoxon matched-pairs signed-ranks test. The non- 
handicapped preferred the friendship choices from their own group over- 
whelmingly (N — 11, ż = 4, significant at the .005 level). The handicapped 
also shared preference for friendship choices from the nonhandicapped group, 
although not significantly so. Eight of the 12 handicapped Ss gave higher 
sociometric scores to the nonhandicapped than to the handicapped. 

Hypothesis III would seem to be partially confirmed. The nonhandicapped 
displayed a statistically significant preference for nonhandicapped friends. The 
handicapped also preferred nonhandicapped friends, but this preference was 
hot statistically significant. 

No relationship was found between subjects’ grade-point average (GPA) 
and sociometric score, Rho = —.09, or Social Affectivity, Rho = —.04. 
The relationship between GPA and Impersonal Social Perception was Rho = 
—.41 (significant at the .05 level). The relationship between visibility of 
handicap and sociometric choice (interjudge reliability: .86, 1.00, and 1.00 
for groups I, II, and III) and sociometric choice ranged from .40 to —.86, 
mean Rho = —.22 (not significant). The correlation between the Impersonal 
Social Perception Scale and Social Affectivity Scale was Rho = 0.56 (signifi- 
cant at the .01 level, df = 0.56). 


D. Discussion 


The results of this investigation appear to confirm the supposition of Kelley 
et al. (11) that physically handicapped persons are less socially sensitive than 
are nonhandicapped persons. With regard to Hypotheses I and II, the data 
would suggest that the physically handicapped were significantly poorer than 
the nonhandicapped in social perception, and that both the nonhandicapped 
and handicapped had significantly more difficulty in attempting to predict the 
behavior of handicapped subjects than that of nonhandicapped subjects. 

The results of Hypothesis IJI lend support to the concepts of minority 
Status of the handicapped and rejection of the handicapped by the nonhandi- 
Capped as suggested by Wright (16) and Smits (13). Rejection of self by 


114 JOURNAL OF SOCIAL PSYCHOLOGY 


the handicapped was hinted at by the finding that eight of 12 handicapped 
subjects expressed preference for nonhandicapped friends. 

The results suggest that Social Affectivity and Impersonal Social Percep- 
tion, although significantly related, do measure different things. Results for 
both Hypotheses I and II suggest that in person-to-person dyadic relationships 
the Tagiuri method provides differences in empathic ability as predicted, while 
in the case of the Impersonal Social Perception Scale such differences are 
not found. 

To account further for the general finding of differences in efficacy between 
the Dymond and Tagiuri method of measuring empathy, the authors took the 
Gage and Cronbach position (9) that a global disposition of the judge may 
account for much of the variance in accuracy scores. It was conjectured that 
one such global disposition by the subjects of the study could be self-concept. 
Specifically, it was hypothesized that the handicapped would exhibit signifi- 
cantly lower self-concept levels than would the nonhandicapped, and that 
the lower self-concept level would act as a substantial deterrent to an accurate 
assessment of the cold hard “facts of life” by peers. To test this hypothesis, 
self-concept scores were computed for all subjects by assigning values to the 
positions that the Ss guessed that they occupied in the other Ss’ list. Thus, if 
an § guessed that she was first on a certain list, she was given a 7; if she 
guessed that she was third on another list, she was given a 5; if she guessed 
that she was given last place on still another list, she was given 1, etc. The 
total of all these scores was a self-concept score for an S. 

Differences between the self-concepts of the handicapped group and the non- 
handicapped group were compared by the Mann-Whitney U. The handicapped 
group was found to have significantly poorer self-concepts than did the non- 
handicapped group (U = 40.5, significant at the .05 level, single-tailed test)- 
Additional support is offered by a count of negative and positive errors which 
revealed that the handicapped had the greater number of errors in the direc- 
tion of positive errors, and the nonhandicapped the greater number of errors 
in the negative direction, although the direction was not significant. 

Results of this study would suggest that the physically handicapped are less 
socially perceptive than a comparable group of nonhandicapped persons. It has 
been suggested by the authors that the basis for this is to be found in the 
inability of the physically handicapped to accept the pervasiveness of their 
rejection. Alternative explanations which were not considered include (4) 
greater use of perceptual defense, (2) anxiety, (c) social deprivation by the 
physically handicapped, (4) a blank social facade presented by nonhandicapped 
to the physically handicapped, or (e) some combination of the above. 
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E. SUMMARY 


is study compared the accuracy of social perception of physically handi- 
tapped and nonhandicapped persons. Social perception was defined as the 
ability to predict another's behavior on the Social Affectivity Scale (a modifi- 
‘cation of the relational analysis of Tagiuri) and on the Impersonal Social 
Perception Scale (a modification of Dymond's Empathy Scale). Results 
that handicapped female college students were significantly poorer at 
ting behavior via the Social Affectivity Scale than were nonhandicapped 
male college students. 
Тһе Impersonal Social Perception Scale failed to separate the groups. Both 
handicapped and nonhandicapped were significantly more accurate at 
edicting the behaviors of nonhandicapped Ss than handicapped Ss on the 
cial Affectivity Scale; the Impersonal Social Perception Scale failed to 
fferentiate between the two groups. Both the handicapped and nonhandi- 
ped expressed greater preference for nonhandicapped than handicapped 
is as friendship choices. 
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CONFORMITY BEHAVIOR AND NEED FOR AFFILIATION* ' 


The Ohio State University and Bucknell University 


PauL E. McGHEE AND RICHARD С, TEEVAN 


A. INTRODUCTION 

Since the early studies of Asch (1, 2), which clearly pointed out the existence 
of consistent individual differences in conformity behavior, an increasing 
amount of attention has been directed toward various "personal" correlates 
of conformity behavior. In light of the social nature of the conformity situa- 
tion, it would seem that need for affiliation (n Aff) might be an important 
variable in determining whether an individual remains independent or yields 
to discrepant group judgments. $ 

Past research relating conformity and n Aff has produced conflicting results. 
Using male high school seniors in a Crutchfield (4) apparatus as Ss, Kaplan 


- (7) found no significant relationship between these variables. Hardy (6), 


using college males as Ss in an Asch-type situation, provided both support and 
Nonsupport conditions and found that high n Aff groups conformed more 
under conditions of nonsupport than under conditions of support. The medium 
п АЁ group was found to conform under both conditions. The low п Aff 
group conformed more under conditions of support than nonsupport; but 
they were considered to be responding more to the objective content of the 
test situation than to its social structure. Samelson (10) used an Asch-type 
Situation and failed to find a significant relationship between n Aff and 
conformity. 

Since the data on the relationship between n Aff and conformity in male 
Ss are conflicting, the present study was designed to investigate intensively 
9ne type of conformity situation, with male Ss. Only perceptual tasks were 
used as measures of conformity so that a situation might be established in 
Which $'s judgment was clearly objectively correct, but differed from that 
of an erring unanimous majority. 

A modified version of the Crutchfield (+) apparatus was used to test the 
hypothesis that high n Aff Ss would conform to a significantly greater degree 

an would low n Aff Ss. 
co—— — 
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B. Метнор 
1. Subjects 


The Ss were 80 male high-school juniors and seniors, All Ss participated 
in the experiment in groups of five; these groups were formed on the basis of 
similarity of class schedules. 


2. Apparatus 


The apparatus consisted of the standard apparatus for the administration 
of the Thematic Apperception Test (TAT), including four TAT slides, 44 
slides consisting of perceptual problems, and a modified version of the Crutch- 
field apparatus. 

The four TAT slides were those used by McClelland et al. (8) to test 
for n Ach; the stories written to these slides were scored for n Aff. A detailed 
description of the structure and functioning of the Crutchfield apparatus may 
be found in Crutchfield (4). 

A distinction was made between three types of slides: filler, bogus, and 
critical slides. The group of filler slides consisted of 22 items in which the 
four responses given by E were objectively correct. The Ss might be in any 
of the five response positions when responding to these slides. The group of 
bogus slides consisted of 12 items in which some of the responses registered by 
E were correct and some incorrect. The Ss might be in any one of the five 
positions when responding to these slides. Neither the filler nor the bogus 
slides were used as measures of conformity. The critical items consisted of 
10 items in which E consistently registered the wrong response on $'s panel. 
The Ss were always last when responding to these slides. The 10 critical items 
were interspersed throughout the filler and bogus slides, so as not to агоиѕе 
S's suspicion. All 44 slides were pretested, and only those slides that were 
answered correctly by the pretest Ss were used as measures of conformity. 
Since the conformity score was based on the number of agreements with the 
incorrect responses registered by E on the critical items, the highest possible 
conformity score was 10. 

All 44 slides involved making such judgments as the relative lengths of 
lines, sizes of squares and circles, the determination of parallel or perpendicular 
lines, and so forth. 


3. Procedure 
Prior to testing for conformity, the TAT was administered to all Ss accord- 


ing to standard neutral instructions (8, p. 186). All four stories were scored 
for n Aff using the method devised by Atkinson, Heyns, and Verof (3): 
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| g reliability with the manual provided by the authors was .95 [computed 
by the method described by McClelland et al. (8, р. 186)]. 

In the conformity experiment, Ss were brought into the experimental room 
five at a time and seated in one of the booths. Before reading the instructions, 
Е took a few minutes to explain that the university was interested in the rela- 
t accuracy of high-school juniors and seniors in making perceptual judg- 
ments and solving perceptual problems. The E then read to the Ss instructions 
thich explained the functioning of the apparatus, and the nature of the Ss’ 
Since the instructions were complex, extra time was taken to clarify any 
tions Ss might have; in some cases portions of the instructions were 
repeated. 2) 

While the instructions indicated that each of the five Ss would respond in 
Succession, the situation was actually contrived so that all Ss responded at the 
same time, while E gave the answers which $ supposedly thought other Ss to 
be giving. With each successive slide, Ss were placed in a different response 
sition. 
C. RrsuLTS 
Тһе n Aff scores of all Ss were separated into two groups, depending on 
whether their score was above or below the median score. "The means for the 
Above and Below groups were, respectively, 4.30 and 3.57, the difference 
veen which is significant (t-test, one-tail, p < .005) [see Edwards (5)]. 
‚ the hypothesis that high п Aff Ss would conform to a significantly 
т degree than low п Aff Ss was confirmed. 


D. Discussion 


While the present study indicates a positive relationship between n Aff and 
conformity, it must be noted that the current data may be culture-bound, 
college students—a very restricted sample of mankind—were used. This 
ibility appears likely, in view of the findings of Milgram (9), who demon- 
ted national cultural differences in conformity in Norwegian and French 
ере students. у 
Since the present experiment placed each individual against a unanimous 
Majority, the results support the finding of Hardy (6) that high n Aff Ss 
Onformed under conditions of nonsupport. They contradict the findings of 
Kaplan (7) and Samelson (10), who found no relationship between n Aff 
conformity, Thus, the question automatically arises as to the reason for 
discrepant results. en | 
‘Since Kaplan (7) also employed a Crutchfield (4) apparatus in his experi- 
Ment, the differential nature of the conformity situation does not appear to 
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be a critical variable. However, a possible explanation of these opposing results 
lies in the degree of arousal of the affiliation motive in the conformity situa- 
tion itself. In conjunction with this variable, it is likely that the affiliational 
relationship among Ss in specific groups is of critical importance. A high n Aff 
person might consistently remain independent in the face of opposing group 
judgments if he did not know the other Ss, or if he disliked them. A low 
п Aff person, on the other hand, might have this affiliational need aroused if 
he is in a group of friends, or persons he admires and would like to have as 
friends. The only study to date which has taken the affiliational relationship 
between participating Ss into consideration is that of Kaplan (7); however, 
Kaplan failed to relate the п Aff motive to conformity behavior within his 
groups of affiliates, neutrals, and hostiles (groups were formed by means of 
а sociogram ). 

If it can be assumed that the degree of arousal of n Aff varies as a function 
of §’s feelings about, and affiliational relationships with, the constituent mem- 
bers of the group, then it may be that uncontrolled differences with respect 
to this factor are responsible for the conflicting data. It would be of value, 
then, for future research with these variables to take this relationship into 
consideration. 

E. SUMMARY 


Eighty male high-school juniors and seniors participated in a conformity 
experiment, using a modified version of the Crutchfield (4) apparatus. The 
investigation studied the relationship between conformity behavior and affilia- 
tive motivation. The Ss, of varying n Aff, were subjected to a simulated 
group pressure situation in which each § was asked for his judgment of а 
given perceptual problem in the face of four earlier erring, but unanimous, 
judgments. A conformity response was defined as yielding to this erring 
majority on those slides which all Ss in a pretest were able to answer correctly. 
The results indicated that high n Aff Ss conform significantly more than low 
n Aff Ss. The probable culture-bound nature of these findings was noted, along 
with a possible explanation of the conflicting findings on the relationship 
between n Aff and conformity. 
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THE VOLUNTEER SUBJECT IN SENSORY RESTRICTION 
RESEARCH* 


The University of North Carolina at Charlotte 


Duane P. Scnutrz' 


A possible limitation to the generalizability of certain research findings using 
human Ss is the restriction of the sample to volunteers. The rapidly develop- 
ing area of sensory and perceptual deprivation seems to rely totally on volunteer 
Ss. Even though these Ss are usually compensated for their period of confine- 
ment, not all potential Ss choose to volunteer (2). The present paper discusses 
two attempts to determine differences in personality characteristics between 
volunteers and nonvolunteers for sensory deprivation. 

Astudy by Myers (1) reported that 73 per cent of a large group of enlisted 
United States Army personnel volunteered to participate in 96 hours of sensory 
deprivation for which they were offered no monetary rewards. The two most 
frequently cited reasons for volunteering were (a) to contribute to a scientific 
effort, and (b) to see how they would react to a stressful situation. Some of 
the reasons for not volunteering were (a) being afraid that they could not 
last the full time, (5) dislike of being in the dark for a long period, and (с) 
being too restless to do nothing for 96 hours. The results of a large battery 
of tests and a biographical inventory revealed that volunteers were lower in 
Depression and Psychopathic Deviancy on the MMPI than were nonvolun- 
teers. There were also nonsignificant trends toward lower scores among 
Volunteers in Hysteria and Psychasthenia on the MMPI. Myers concluded 
that the volunteer $ is a sounder and more stable personality than is the 
nonvolunteer, d 

"The present study utilized 59 female college sophomores in introductory 
Psychology classes at Mary Washington College, the women's branch of The 
University of Virginia. All Ss took the Cattell 16-PF Test under conditions 
9f complete anonymity using a coded number system with each $'з number 

ing known only to herself. This was done in an effort to secure as valid 
test responses as possible. One month later, Ss were told that a series of studies 
Оп sensory deprivation was going to be undertaken at the college. The condi- 
tions of confinement (darkness, silence, remaining quiet and still, etc.) were 
>... 
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explained to them and Ss were asked to volunteer to serve for three hours in 
isolation. They were also told they would be paid at the rate of $1.00 an 
hour. Ss indicated in writing whether or not they would volunteer. | 

Of the 59 Ss, 81 per cent volunteered, while 19 per cent did not. These 
figures are not too dissimilar from Myers’ 73 per cent volunteer rate if one 
considers the shorter confinement period and the inducement of compensation 
in the present study. 

The majority of the volunteers gave as their reason curiosity about isolation 
and about their own reactions in that situation, Additional reasons offered 
were wanting to participate in scientific research, liking a new experience, 
and one $ even said she thought it would be fun. Only three Ss said they 
volunteered “for the money.” The majority reason given for not volunteering 
was the expressed feeling of “not being able to stand it.” 

None of the volunteer-nonvolunteer differences on the Cattell Test proved 
to be significant. When a lower level of confidence (.10) is taken, however, 
two interesting trends appear. Volunteers scored higher on Factors C and H 
than did nonvolunteers. The higher score on both factors suggests that the 
volunteers are more emotionally mature and stable, and more adventurous, 
than are the nonvolunteers. 

Although these findings did not reach statistical significance, their agret- 
ment with the results of Myers suggests fruitful leads for further research. 
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CTORS INTERACTING WITH BIRTH ORDER IN SELF- 
SELECTION AMONG VOLUNTEER SUBJECTS* 


The City College of The City University of New York 


BARBARA SNELL DoHRENWEND, SoL FELDSTEIN, JOYCE 
PLoskv, AND GERTRUDE К. SCHMEIDLER 


OA. INTRODUCTION 


tudies of the relation between birth order and volunteering for psychologi- 
il experiments have yielded conflicting results. While Capra and Dittes (1), 
dfeld (7), and Varela (8) all showed a disproportionate rate of volunteer- 

by firstborns, Ward (9) reported no difference between first- and later- 
огпз. Reanalysis of Ward's data in the light of Schachter’s findings concern- 
g the relation between birth order and affiliative motives (5) reveals, 
er, that there was a significant tendency for firstborns, when not under 
ure to volunteer, to avoid an experiment that would isolate them socially.! 
result, together with those of Capra and Dittes, and Varela, showing 
oportionate volunteering for group discussion experiments by firstborns, 
istently suggests that volunteering is related to the strong affiliative needs 
lürstborns, However, this interpretation was questioned by Suedfeld on 
he basis of his finding that firstborns volunteered disproportionately for a 
periment. A further problem concern- 
the interpretation of these studies is raised when we note that, while 
achter showed that affiliative motives are aroused in firstborns only when 
are anxious (5, pp. 46 and 100), there is no evidence presented by Capra 
Dittes, Varela, or Suedfeld that their subjects were anxious at the time 
V were asked to volunteer. Thus, there is no reason to assume that their 
vior was related to affliative motivation. Only Ward's subjects, students 
the first session of an elementary psychology course, were probably some- 
What anxious. Ў 
"Two related hypotheses are suggested by the authors’ analysis of these four 
ts: (а) when pressure to participate in an experiment is not high and 
lety is aroused, firstborns are more likely than later-borns to avoid a socially 


у isolating sensory deprivation ех 


Received in the Editorial Office, Provincetown, Massachusetts, on March 9, 1966. 


yright, 1967. ] Press. 4 , 
Table 1 M des doit dew that among subjects recruited for the “Alone” 
ment before they were informed of a course requirement to volunteer at some 
during the semester, 32 per cent of firstborns and 55 per cent of later-borns 
nteered (x2 = 4.95; with 1 df, Ё < 405). 


125 


126 JOURNAL ОР SOCIAL PSYCHOLOGY 


isolating experiment; and (4) firstborns are more strongly attracted than later- 
borns to the role of experimental subject regardless of the content of the study, 


B. PROCEDURE 


In the present study, volunteers for a sensory deprivation study were 
solicited with an announcement posted on bulletin boards at The City College. 
This announcement included statements designed to arouse anxiety, such as, 
“Many people have found the sensory deprivation experience an unpleasant 
one.” After a volunteer had responded to the posted notice and had been 
accepted as a subject, he was told that he was to be paid $50. This large 
sum served to convince him further that he would be undertaking a difficult 
and unpleasant task. 

In an interview the day before the deprivation session, subjects were ques- 
tioned about whether they chronically experienced 22 psychological symptoms 
which Midtown researchers reported discriminated at the .01 level or better 
between a group of patients and persons rated "well" on the basis of a half- 
hour psychiatric interview (4), Immediately following the eight-hour depriva- 
tion session, before the subjects left the chamber, they were asked about 20 
of these 22 items with reference to whether they experienced them during 
deprivation. On the basis of evidence that symptoms, such as the ones included 
in this battery, indicate reactions to immediate stressors (3), these 20 items 
were used in the present study to measure emotional disturbance experienced 
during sensory deprivation, relative to levels "experienced in more usual 
situations of daily living.” 


C. RESULTS 


Since the authors endeavored to arouse anxiety in connection with their 
recruitment for a socially isolating sensory deprivation experiment, the number 
of firstborn volunteers should have been relatively low. Comparison with 
Suedfeld’s (7) results indicates that this was the case. While 23 of his 29 
subjects were firstborn, of the authors’ 16 subjects only seven were firstborn. 
This difference yields а x? of 4.38 which, with one degree of freedom, has 
a probability of less than .05. 

Concerning the motivation of the firstborns who did volunteer, one possi- 
bility is that they are individuals who, though firstborn, are able to cope with 
anxiety in social isolation. The other possibility is that they conform to the 
dominant pattern among firstborns and arc, as Wrightsman has shown (10), 
not able to reduce their anxiety when socially isolated. The latter finding 
would imply a motivation to participate in the experiment strong enough to 
overcome reluctance based on the threat of social isolation when anxious. 


BARBARA SNELL DOHRENWEND, ET AL. 127 


Table | shows that, among subjects whose level of symptoms changed from 
"usual level during the deprivation experience, firstborns were significantly 
likely than later-borns to show an increase. The authors infer, therefore, 
at firstborns experienced more anxiety than later-borns in the sensory depriva- 
on situation, indicating that it was an aversive situation for the firstborns 
вре their having chosen to participate. 


ч ТАВГЕ 1 
| NuMBrR or PSYCHOLOGICAL SYMPTOMS EXPERIENCED BY First- AND LaTER-BORN 
JECTS DURING SENSORY DEPRIVATION COMPARED TO USUAL NUMBER OF SYMPTOMS 


Number of symptoms Number of Number of 
during sensory deprivation firstborn later-born 
relative to usual number subjects subjects 
Higher 4 2 
Same 3 2 
Lower 5 
p «09 


D. Discussion 


results of the present study indicate that the affiliative motives of first- 
tend to make them avoid a socially isolating experiment when anxiety 
oused. The authors also find support for the hypothesis that firstborns 
strongly attracted to the role of experimental subject regardless of the 
tent of the study. The authors’ lack of understanding of the motivation 
derlying this attraction raises am issue concerning the effects that volunteer 
ects may have on the outcomes of experiments. Of particular concern is 
* possibility that volunteers are seeking to affiliate with the experimenter, 
Dittes (2) and Staples and Walters (6) have both demonstrated that 
ns whose dominant motivation is affiliation tend to accept suggestions 
those with whom they seck to affiliate. If motivated in this way, volunteer 
ects would tend to facilitate experimenter bias effects by dint of their readi- 
to be influenced by the experimenter. On the other hand, some other 
otive not associated with suggestibility, such as curiosity, may be central to 
nteering. As long as we do not understand why subjects volunteer, how- 
, We cannot even guess to what extent their motivation amplifies or 


pens experimenter bias effects. 
4 E. SUMMARY 


Analysis of previous studies of the relation between birth order and volun- 
ng for psychological experiments suggests two related hypotheses: (a) 
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when pressure to participate in an experiment is not high and anxiety is 
aroused, firstborns are more likely than later-borns to avoid a socially isolating 
experiment; and (5) firstborns are more strongly attracted than later-borns 
to the role of experimental subject regardless of the content of the study. 
Evidence is presented in support of both hypotheses. Confirmation of the 
latter raises the issue of whether the motivation of volunteer subjects makes 
them liable to facilitate experimenter bias effects. 


REFERENCES 
1. Carra, P. C, & Drrres J. E. Birth order as a selective factor among volunteer ' 
subjects. J, Abn. & Soc. Psychol., 1962, 64, 302. 
2. D J. E. Birth order and vulnerability to differences in acceptance, Amer. 
Piychol., 1961, 16, 358 (Abstract). 
3. DowxzwwrNo, B. S, & Donrenwenv, B. P. Stress situations, birth order, and 
Psychological symptoms, J. Abn. Psychol., 1966, T1, 215-223. 


+. Lawonex, T. S. A twenty-two item screening score of psychiatric symptoms indicat- 
ing impairment. J. Health & Hum. Behav., 1962, 3, 269-276. 


Scuacuter, S. The Psychology of Affiliation: Experimental Studies of the Sources 
of Gregariousness. Stanford, Calif.: Stanford Univ. Press, 1959. 


s, 

6. STAPLES Р. R, & Wavrexs, R. H. Anxiety, birth order, and susceptibility to social 
influence, J. Abn. & Soc. Psychol., 1961, 62, 716-719. 

7. 


. Sure, P. Birth order of volunteers for sensory deprivation. J. Abn. & Soe. 
Prychol., 1964, 68, 195-196. 


VARELA, J. A. A cross-cultural replication of an experiment involving birth order. 
J. Abn. @ Soc. Psychol., 1964, 69, 456-457. 


9. Waro, C. D. A further examination of birth order as a selective factor among 
volunteer subjects. J. Abn. & Soc. Psychol., 1964, 69, 311-313. 


10. Weicurssan, L. S., Jr. Effects of waiting with others on changes in level of felt 
anxiety. J. Abn. & Soc. Psychol., 1960, 81, 216-222. 


Department of Psychology 

The City College of The City U niversity 
of New York 

New York, New York 10031 


The Journal of Social Psychology, 1967, T2, 129-134. 


PRACTITIONERS’ PERCEPTUAL CONSISTENCY OF MENTAL 
PATIENTS’ BEHAVIORAL CHARACTERISTICS* 


Division of Social Medicine, Montefiore Hospital amd 
Medical Center, Bronx, New York 


IRWIN GERBER 


А. INTRODUCTION 


In recent years a significant development in the scientific study of the mental 
hospital has been a shift from the analysis of the patient to the analysis of 
practitioners who service these patients. In particular, investigations by 
Gerjuoy (1), Lefton (2), Stanton and Schwartz (5), and Wallace (6) have 
explored the problem of how psychiatric practitioners perceive mental*patients’ 
psychotic behavioral patterns. The general argument of these investigations 
is that if practitioners in the mental hospital disagree in their perceptions, this 
discord will hinder the proper servicing of mental patients. It is assumed that 
differences in practitioners’ perceptions will generate behavior on the part of 
practitioners which will have an effect on the proper care and treatment of 
patients, 

If it can be assumed that differences in practitioners’ perceptions of patients’ 
psychotic behavioral patterns influence the type of treatment offered to pa- 
tients, then differences in the consistency of practitioners’ perceptions may also 
Potentially affect the treatment of patients. The general problem of this 
Present report is to explore whether practitioners differ in their consistency of 
Perceptions and to discover some of the relevant variables related to these 
differences, if they do exist. 

A mentally ill person exhibits many types of psychotic behavioral patterns. 
This is not to say that a particular patient will exhibit all types of pathological 
- behavior, but rather the patient will generally exhibit more of one type of 

behavior than another. This factor is crucial for the mental hospital because 
à patient may shift through different patterns of behavior during the course of 
his treatment. One can assume that if certain practitioners are consistent in 
their perceptions of mental patients, these practitioners will consider all pa- 
tients’ behavior as representing either extreme forms of pathology or com- 
Pletely normal behavior. This perceptual consistency does not increase the 
E ——__ 
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patient's chances for recovery, In order for practitioners to treat patients prop- 
erly, they must be aware of differences which exist between various types and 
degrees of mental illness. 


B. MrrHopoLocv 


The data to be used in this study were collected from an investigation 
conducted at the Jacob L. Reiss Psychiatric Pavilion of the St. Vincent's 
Hospital of the City of New York.! The sample consisted of 35 staff members 
(14 psychiatric residents, 10 nurses, nine social caseworkers, and two social 
group workers) of the Pavilion, who rated a total of 48 patients. 

‘The instrument used to measure practitioners’ consistency of perceptions of 
patients’ psychotic behavioral characteristics was the Psychotic Reaction Profile. 
This rating form developed by Lorr, O'Conner, and Stafford (3) measures 
four second-order factors previously found in multiple factor studies of inter- 
view and ward behavior. The Psychotic Reaction Profile was originally de- 
veloped in order to standardize a multidimensional behavioral inventory for 
the description of obse le psychotic behavior in a mental hospital. The four 
independent (second-order) factors of the Psychotic Reaction Profile are as 
follows: 

1. The Withdrawal Scale (W). This is the best balanced and most reliable 
scale with a correlation coefficient of .94. The scale is composed of 38 state- 
ments and is best defined by three elements of patient activity: manifest 
interest in the environment, activity level, and interpersonal responsiveness. 

2. The Thinking Disorder Scale (D). The Thinking Disorder Scale is 
composed of 18 items and has a reliability coefficient of .90. Disorientation, 
irrelevant and incoherent speech, and hallucinations are the major charac- 
teristics of this factor. 

3. The Paranoid Belligerence Scale (B). Paranoid Belligerence is orga- 
nized around 24 statements and has a high reliability coefficient of .90. Pa- 
tients scoring high on this scale are generally hostile, irritable, bossy, and 
paranoid. 

4. The Agitated Depression Scale (D). Agitated Depression is the least 
reliable scale composed of only five statements and has a reliability coefficient 
of .74, The major behavioral patterns are anxiety and depression. 

The Psychotic Reaction Profile was completed by all sampled practitioners 
at the time patients were discharged from the Pavilion. Consistency of pet 
ceptions was measured by correlations (the product moment correlation) 


k 1 The parent study was supported in whole by the Department of Health, Educa- 
tion, and Welfare, Office of Vocational Rehabilitation (Project Number 578). 
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between each of the four independent factors of the Psychotic Reaction Profile. 
Differences between correlations for two different staff groups were com- 
puted by transforming correlation coefficients into standard scores (zs) and 
testing for differences between the standard scores (4). 


C. RrsuLTS 


"Table 1 indicates that for psychiatric residents there are significant correla- 
tions between the Withdrawal and the Paranoid Belligerence patterns, between 
the Withdrawal and the Agitated Depression patterns, and between the 
Thinking Disorder and the Paranoid Belligerence patterns. For nurses all 
correlations are statistically significant, except for correlations between the 
Withdrawal and the Thinking Disorder patterns, and between the Thinking 
Disorder and the Paranoid Belligerence patterns. Table 1 also reveals that for 
social caseworkers and social group workers all psychotic behavioral patterns 
are significantly correlated. On the basis of the above results, one can con- 
jecture that practitioners (psychiatric residents and nurses) who are closer 
to the ward setting are less consistent in their perceptions of patients’ psychotic 
behavioral patterns than practitioners (social rkers and social group 
workers) who primarily service patients off the . However, there still 
remains the need to verify statistically whether different staff groups are differ- 
ent in their perceptual consistency. 


TABLE 1 


CORRELATIONS AND z SCORES OF PERCEIVED PsvcHoric BEHAVIORAL. PATTERNS OF 
PATIENTS AS RATED BY PSYCHIATRIC Reswents, Nurses, SOCIAL 


Case- Өгөр 
Correlated Residents Nurses _ workers wo! ы 
Patterns r * r Е г Е r 
* 62 
W x TD ; 45 32% 33 4° 75 £I B 
W xB pomo. 07 07 "nC я 4 
Ухо ase 49 38% 39 43° 46 ш 7 
TD X B 49% 453 —0s 05 $5* 183 »* 18 
TD X D iU 33. 134 ase 048 6037 
Bx D 20 20 —3@ +37 46. 4 38. 0 


ithdrawal, Thinking Dis- 

* For all tables W, TD, B, and D are symbols for the Wit ‚ Thir ] 
order, | Paranoid Belligerence, and Agitated Depression Scales of the Psychotic = 
Profile. Also, an X between two psychotic behavioral patterns represents a Corre ation 
Computation. iah 

* Statistically significant correlation at the .05 level of significance. 
The following set of results pertains to differences in correlations (as 
Tepresented by z score differences) of perceived psychotic behavioral patterns 
of patients as rated by psychiatric residents, nurses, social caseworkers, and 
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social group workers. The direction of differences between staff groups is 
indicated by the correlations in Table 1. 


Table 2 indicates that psychiatric residents significantly differ from nurses 
for the correlation between the Thinking Disorder and the Paranoid Belliger- 
ence patterns. The difference is that residents show a higher correlation be- 
tween these two patterns than do nurses. Table 2 also reveals that residents 
significantly differ from caseworkers for correlations between the Withdrawal 
and the Thinking Disorder patterns, between the Withdrawal and the Para- 
noid Belligerence patterns, between the Thinking Disorder and the Paranoid 
Belligerence patterns, and between the Thinking Disorder and the Agitated 
Depression patterns. The direction of differences for all significant results 
indicates that residents have lower correlations than do social caseworkers. 
For the comparison between residents and social group workers there are 
significant differences for correlations between the Withdrawal and the 
Thinking Disorder patterns, and between the Thinking Disorder and the 
Paranoid Belligerence patterns, The direction of differences indicates that 
residents have lower correlations than do social group workers. 


ь; ТАВГЕ 2 
DIFFERENCES (z DIFFERENCES) IN CORRELATIONS OF PERCEIVED PSYCHOTIC BEHAVIORAL 
PATTERNS OF PATIENTS as RATED BY PsYCHIATRIC RrsmDENTS, Nurses, 
SOCIAL CASEWORKERS, AND SOCIAL Group WORKERS* 


Case- 
Residents — Residents Nurses Nurses workers 
Residents and and and and and 
and case- group case- group group 
nurses workers workers workers workers workers 
Correlated * z * x * z 
patterns differences differences differences differences differences differences 
W x TD —.18 — .60* — 47° — 42* .29* 12 
WXB 22 5да — 31 — .75* — .53* 22 
WxD 10 03 — 08 — 07 — 18 —41 
TD x B 43% —1.30* —1.29* —1.78* —1.78* 00 
TD X D —33 — 47° — 36 — 14 — .03 ll 
вхр —.17 — .29 20 — 43 — .04 .09 


* See Table 1 for an explanation of all abbreviations. 
* Statistically significant difference for two-tailed z test. 


Nurses significantly differ from social caseworkers for correlations between 
the Withdrawal and the Thinking Disorder patterns, between the With- 
drawal and the Paranoid Belligerence patterns, and between the Thinking 
Disorder and the Paranoid Belligerence patterns, The direction of differences 
indicates that nurses have lower correlations between these patterns than do 
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workers. Table 2 also indicates that nurses significantly differ from social 
foup workers for correlations between the Withdrawal and the Paranoid 
rence patterns, and between the Thinking Disorder and the Paranoid 
rence patterns. As was the case for differences between caseworkers, 
have lower correlations than do social group workers, 


D. Discussion 


l the basis of the findings, it can be concluded that practitioners who play 
foles which are closer to the ward setting are less consistent in their perceptions 
Of patients’ psychotic behavior than practitioners who primarily service patients 
off the ward. 

For differences in correlations, it was noted that both residents and nurses 
liffered from social caseworkers and social group workers, The significance of 
ese differences is that residents and nurses showed lower correlations between 
dependent behavioral variables than did caseworkers and group workers. The 
ect of practitioners’ role-positions in relation to the ward setting on differ- 
in the consistency of perception is further supported by the result that 
es differed from residents in consistency, while caseworkers did not differ 
m group workers. Although residents were closer to the ward setting than 
orkers and group workers, residents were not as close to the ward as 
nurses. Therefore, while residents’ role obligations seem to make them 
aware of differences between patients’ psychotic behavioral patterns 
n caseworkers and group workers, residents seem to differentiate less be- 

these independent variables than do nurses. Furthermore, caseworkers 
group workers who both function away from the ward setting did not seem 
differ in correlations between psychotic behavioral variables, but did differ 
m those practitioners (residents and nurses) who were closer to the ward 
- It can be conjectured that differences in practitioners’ consistency of per- 
ns of patients’ psychotic behavioral patterns have major implications for 
therapeutic process. If one set of practitioners are consistent їп their per- 
ns of patients’ psychotic behavior, while a second set of practitioners 
rentiate between different types of psychotic behavior, then it can be 
ed that these practitioners will differ in their approaches toward patients. 
mental illness is defined by many different types of overt behavioral 
terns, practitioners cannot assume that a patient who exhibits agitated be- 
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havior will also be depressed. A practitioner who generalizes all behavior of 
patients as being pathological cannot be fair in his treatment of patients. This 
is equally true when a practitioner is consistently lenient in his judgment of 
mental patients. 


E. Summary 


Fourteen psychiatric residents, 10 nurses, nine social caseworkers, and two 
social group workers rated 48 psychiatric patients on the Psychotic Reaction 
Profile. The findings indicate that different practitioners from various pro- 
fessional services differ in their consistency of perceptions of patients’ psychotic 
behavioral patterns. Those practitioners (psychiatric residents and nurses) who 
were clinically closer to the ward setting were able to differentiate more 
between different psychotic variables than practitioners who were farther away 
from the ward (caseworkers and group workers). 

The findings of this study were seen as having potential significance for the 
functioning of the mental hospital. Conflicts engendered between practitioners 
as to how patients should be approached, produced by their differing perceptions 
of the consistency of patients’ conditions, might prove disfunctional for the 
total organization. 
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A, INTRODUCTION 


In a number of studies, both in England and in the United States, it has 
been found that severity of moral judgment is associated with various indices 
of religious belief and practice (1, 2, 3, 4, 5, 6, 8, 9). Most of these studies 
lave focussed upon various forms of sexual behavior, but they also provide 
tvidence that the tendency for believers, as compared with unbelievers, to pass 
Stricter judgments is to some extent general. Among English samples, believers 
have been found to be more likely than unbelievers to condemn litter dropping 
and tax evasion, and to say that they are worried about exceeding the speed 
limit. 

Middleton and Putney (7), however, offer grounds for thinking that the 
relationship between moral judgment and religion is not entirely general, but 
depends upon the nature of the behavior being judged. They distinguish be- 
tween “ascetic” or "private" morality, where the consequences of actions are 
- felt mainly by the wrongdoer, and “social” morality, where the consequences 
_аге felt in the first instance by others. The authors argue that, since social 
Morality is a function of living in communities, it will be common to believers 
and unbelievers alike, whereas “ascetic” morality derives from religious tra- 
ditions and will be less strongly supported by unbelievers than by believers. 
They do, of course, recognize fully the tentative nature of the distinction. The 
_ amples of behavior falling within the concern of ascetic morality which they 
Used in their study were various forms of sexual activity, gambling, smoking, 
and drinking; social actions were represented by stealing, cheating, and ag- 
_ Bression toward others. With a sample of American students, they found sig- 
nificant relationships between indices of religiosity and the condemnation of 
- ahtiascetic behavior, but no evidence of corresponding relationships for anti- 
Social behavior. 


* Received in the Editorial Office, Provincetown, Massachusetts, on March 27, 1966. 
“pyright, 1967, by The Journal Press. 
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A further result of the Middleton and Putney study, and one which has 
been found elsewhere, was that women tended to adopt a stricter moral tone 
than do men. This was true for both ascetic and social issues, but the differ- 
ence was greater for the former than for the latter. 

The main purposes of the present study are as follows: (a) To examine the 
degree of severity of moral judgment, on a number of behavioral items, of a 
socially defined segment of the population; to test for generality of moral 
judgment over different items; and to consider the relationship of severity of 
moral judgment to such factors as sex difference and experience of coeducation. 
(b) To analyze the relationship between moral judgment and various indices 
of religious belief and practice, and to discover whether this relationship is 
stronger for ascetic than for social aspects of morality. 

The material to be presented forms part of the results of a questionnaire 
survey designed originally to investigate certain aspects of religious education. 
The study was sponsored by the Christian Education Movement, and sup- 
ported by a grant from the Department of Education and Science. Those 
findings which are relevant to religious education will be reported elsewhere. 


B. METHOD 
1. The Sample 


The sample consisted of 2276 pupils, of whom 49.6 per cent were boys and 
50.4 per cent girls, drawn from the sixth forms of 96 maintained grammar 
schools in England, excluding Wales and Scotland. For those unacquainted 
with the English system, this means pupils who are above average in intelli- 
gence, who are mostly pursuing courses of study designed to prepare them for 
various institutions of further education, and whose parents lack either the 
money or the desire to send them to schools that are independent, or semi- 
independent, of the State system. Thirty-five of the schools were coeducational, 
the remainder restricted to one sex. The ages of the pupils ranged from 16 
to just over 18 years, with the majority falling between 17 and 18 years. 

A list of 100 schools, drawn at random from the total number of main- 
tained grammar schools in England, was obtained from the Ministry of Edu- 
cation, and the headmasters and headmistresses invited to cooperate. Sixtetn 
schools declined. Of these, 12 gave fairly full reasons for refusing, all of which 
were of a practical nature and none of which suggested a relevant, systematic 
bias. However, the only two Catholic schools declined to cooperate. A further 
random sample of 16 schools was approached. All were prepared to cooperate 
but the scripts from four arrived too late or were lost in the post. Within 
each school, 25 pupils were chosen from the second year of the sixth form 
according to a predetermined method. Not all schools had 25 in this class. 
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2. The Questionnaire 


he questionnaire contained items on religious belief and practice, the Bible, 


last sections. 

the moral judgment section, subjects were presented with a list of differ- 
t forms of behavior and asked to rate them. The list contained the following: 
ambling, Drunkenness, Smoking, Lying, Stealing, Premarital Sexual Inter- 
, Suicide, and Color Bar. These were chosen because it was found that 
were among the moral issues most discussed by boys and girls of the 
d to be sampled. Subjects were asked to rate cach of them on the following 


A—it is always wrong, 

B—it is usually wrong but is excusable in certain circumstances, 
C— it is usually excusable but is sometimes wrong, 

D—it is never wrong, and 

E—I have not made up my mind. 


A space was provided below each item, and subjects were urged to give the 
ons for their choice. These comments will be analyzed in detail elsewhere ; 
| the present context only occasional reference will be made to them. 
he indices of religiosity used were as follows: 2 five-point rating scales, 
of confidence in the existence of God, and the other of confidence that 
s was the Son of God; a four-step measure of frequency of church at- 
ance, from weekly to never; a three-step measure of private prayer, from 
to never; and denomination. 
е questionnaires were completed in school time and under supervision. 
inonymity was guaranteed. 
C. RrsULTS AND DISCUSSION 

1. General 


е percentages of the total numbers of boys and girls falling in each cate- 
ОЁ response for each moral judgment item are given in Table 1. 


le grounds that it was the only way to sample a variety of different sorts 
ehavior in a comparable fashion and in a short space of time, that the 
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TABLE 1 
THE PERCENTAGES OF THE TOTAL SAMPLE ENDORSING THE DIFFERENT RATING CATEGORIES 
nee PEROHITAOD OF TER Илл SEEMS 
Moral Always Usually Sometimes Never 
issue Sex wrong wrong wrong wrong Undecided 
: Boys 10.7 16.9 $7.0 8.3 7.1 
Gambling Girls 19.4 27.1 37.9 3.5 12.1 
Boys 30.6 46.5 17.9 2.0 3.0 
Drunkenness Girls 4.9 m 73 11 1.8 
b Boys 14.9 10.2 32.4 31.6 10.9 
Smoking Girls 124 8.9 36.2 244 18.2 
Boys 19.1 73.9 5.2 0.5 13 
Lying Girls 23.8 72.8 2.2 0.3 0.9 
: Boys 70.8 27.5 0.6 0.3 0.8 
да Girls 712 273 0.5 01 04 
Жа, Воуз 28.6 27.6 20.5 10.2 13.1 
Intercourse Girls 55.8 25.2 6.6 24 10.0 
F, Boys 32.0 28.7 13.0 15.2 114 
Suicide Girls 334 32.7 10.1 87 151 
Воуз 74.0 16.8 3.2 24 3.6 
Color Bar Girls $53 $9 16 1.5 27 


rating categories permitted subjects to give qualified responses, and that both 
in a pilot study and in the main survey it was plain that subjects found the 
operation a meaningful one. It can be added that less than 5 per cent of the 
subjects made comments that were judged to be logically inconsistent with the 
rating category chosen. 

A further caution needed when comparing items for severity of condemna- 
tion is that responses might have been influenced by the connotative associations 
of the labels used. To illustrate, the label “Premarital Sexual Intercourse” 
may have produced fewer negative responses than would have occurred if the 
term "Fornication" had been used. However, it was impossible to make the 
labels equally neutral in evaluative connotation without elaborate circumlocu- 
tion, so instead the simplest and most direct terms were selected. 

As can be seen from Table 1, the item which received the most uncom- 
promising condemnation was Color Bar, with Stealing a close second. These 
distributions, together with the comments made, suggest that property and 
social justice are important concerns of the subjects. At the other extreme 
Smoking and Gambling received the least condemnation, with nearly a third 
of the boys and a quarter of the girls judging that the former was not really 
a moral issue at all. 

In order to test whether severity of judgment is general or specific to each 
issue, the items were interrelated via the chi square test. For the boys, 23 out 
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the 28 tests were significant (21, p < .001; and two, p < .01), and five 
(сте not significant; for the girls, 22 tests were significant (13, p < .001; 

$< .01; and four, p < .05) and six were not significant. Of the 11 non- 
‘Significant results, eight concerned Color Bar. Among boys, this item is inde- 
pendent of Gambling, Smoking, Stealing, and Premarital Sexual Intercourse, 
- but related to Drunkenness and Lying (p < .01) and Suicide ($ < .001) ; 
"among girls it is independent of Gambling, Drunkenness, Smoking, and Steal- 
‘ing, and only weakly related to Lying, Premarital Sexual Intercourse, and 
Suicide (^ < .05). The remaining nonsignificant results were less systematic. 
Among boys, Suicide and Lying—and among girls, Drunkenness and Lying, 
- and Suicide and Stealing—were unrelated. A possible explanation is that these 
pairs each include a social and an ascetic item. However, other social and 
ascetic items are related, and the two most social items (as rated by independent 
Tudges)—namely, Stealing and Color Bar—are not related to each other for 
either sex. 
| In general, then, there is a strong tendency for those who adopt a strict 
‘moral attitude on one item to do so on the others, with the exception of Color 
Bar. It is not entirely clear why Color Bar should be an exception, but the 
comments written afford one clue. From the remarks of a minority of subjects, 
mit seems that for some subjects Color Bar was conceived in logical rather than 
— moral terms. These subjects stressed the complete irrelevance of skin color, 
‘comparing it with hair color or weight, rather than focussing upon those per- 
sonal consequences of being the object of discrimination which make the issue 
< amoral one. This may reflect the fact that many of the subjects had had no 


ect experience of colored people. 


2. Nonreligious Factors 


© Sex difference. Girls were significantly more severe in their judgments 


than were boys on all issues with the exception of Stealing, where the difference 


жаз in the same direction but not significant, and Smoking, where the differ- 


ence was significant but not in the same direction. On the latter item, girls 
Were more undecided than were boys. 
— b. Age and subject studied. No re 
- Judgment responses and type of subject stu 
Same was true for age, except that among gi $ 
( < .01), with both the younger and the older girls being more condemnatory. 
с. Coeducation. Within each sex, pupils in single-sex schools were compared 
With pupils in coeducational schools to see if there was any association between 


lation could be found between moral 
died at school for either sex. The 
tls, age and Suicide were related 
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experience of coeducation and moral judgment. Among boys, no evidence of 
such an association could be found for any of the moral issues. Among girls, 
judgments were significantly related to type of school on only two issues, 
Smoking (p < .02) and Premarital Sexual Intercourse (р < .05). In each 
case, girls in coeducational schools were less condemnatory than girls in single- 
sex schools. This suggests that coeducation is not an important factor in moral 
judgment, in marked contrast to the relation between religious belief and со- 
education (10). However, if the differences in moral judgment between types 
of school were in the main not significant, such differences as did exist were 
consistently larger for girls than for boys. Contingency coefficients were cal- 
culated, and it was found that seven out of eight were larger for girls than 
for boys (p < .03 on a Sign Test). Moreover, all the differences for the girls 
were in the direction of closer similarity to boys in coeducational schools, 
whereas boys in coeducational schools were more similar to girls on half 
the items only, as compared with boys in single-sex schools. The evidence, 
though slender, supports the finding for religious belief (10) that girls 
in coeducational schools are more likely to be assimilated to the boys than 
the other way round. 


3. Religious Factors 


All moral judgment items were tested for association with the four indices 
of religious belief and practice using chi square. Among boys, all associations 
were significant, and most very highly, with the exception of that between 
belief in God and Smoking. Among girls also, all associations were highly sig- 
nificant except those between Color Bar and belief in God, churchgoing, and 
private prayer, which were not significant; and that between belief in Christ 
and Color Bar, which was only weakly significant (p < .05). In all instances, 
the religious subjects were more likely to adopt the stricter moral position, 
though on one or two items the relationship was not entirely linear. Few de- 
nominational differences could be found, although there was a significant 
tendency for Baptists and members of smaller Christian sects to be more 
strict than others about Gambling and Drunkenness. 

The extent to which the moral items were related to indices of religiosity 
is indicated by Table 2. This table is based only on those subjects who endorsed 
the “always wrong” category. For each moral item, the percentages of those 
in the "always wrong" category who adopt the extreme religious position 0n 
the several indices are given. For comparative purposes, the table also includes 
the percentages of the total sample who fall into the extreme religious cate- 
gories, and the corresponding percentages of those subjects who endorse either 
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ТАВІЕ 2 
Tue PERCENTAGES OF THOSE ENDORSING THE “ALWAYS WRONG” CATEGORY, 
Wuo Fatt IN THE Extreme RiLiGiOUS Posrrions 


Completely pose rmi 
confident of confident Daily 
the existence Jesus was the Weekly private 
of God Son of God churchgoing prayer 
Boys 
37.0 442 59.5 421 
35.8 36.2 46.8 324 
21.9 21.9 357 25.5 
32.7 33.6 414 32.6 
21.9 21.8 32.3 21.6 
ital Sexual 
35.6 35.8 44.8 30.2 
30.1 30.6 40.3 319 
22.0 22.5 32.7 
19.6 20.9 30.1 204 
ts endorsing 
A or B on all items 48.6 50.7 614 43 
Girls 
bling 59.6 63.4 76.7 55.1 
enness 49.3 52.4 541 445 
ing 46.7 47.0 54.5 504 
51.0 50.6 56.9 43.6 
ng 42.0 434 51.6 39.1 
iarital Sexual 
course 50.6 53.1 60.0 47.3 
de 56.2 54.3 62.8 48.0 
Bar 42.8 42.5 51.8 39.1 
sample 39.1 394 49.7 37.7 


cts endorsing 
A or B on all items 61.8 66.7 73.5 59.8 
: “А” means the action is judged to be always wrong, and “В” means the action 


ged to be usually wrong but excusable in certain circumstances. 


o B on every moral item. This last group was very small, consisting of 


il dimension, 1 > 
їп rank order for this dimension using the method of paired comparison. 


tic moral issues were defined as those in which the consequences of behavior 
t primarily by the individual, social moral issues as those in which the 
quences are felt primarily by others. Coefficients of Concordance between 
's were found to be .45 (p < .001) for males and .64 (p < .001) for 
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females. Ranks were averaged and reranked, giving a final rank order for 
each sex. The correlations between asceticism and religiosity were then cal- 
culated for each sex, and are given in Table 3. 


TABLE 3 
CORRELATIONS BETWEEN THE Ascericism OF MORAL ITEMS AND 
THER Association WITH RELIGIOUS BELIEF AND PRACTICE 


(N = 8) 

Boys % Girls % 
Belief in God 51 (n.5.) Belief in God A3 (ns) 
Belief in Christ 36 (n.s.) Belief in Christ .57 (n.8.) 
Churchgoing .53 (n.s.) Churchgoing 45 (n.s.) 
Private prayer 31 (n.s.) Private prayer 76 (.05) 


Although all correlations are in the predicted direction, only one is signifi- 
cant. The item which fitted the prediction least well was Smoking, since it was 
ranked the most ascetic by both male and female judges, and yet was not 
strongly related to religion, especially among boys. This item is, however, to 
some extent atypical, since over 40 per cent of both boys and girls were un- 
decided, or did not think it a moral issue at all. If this item is dropped, five 
out of the eight correlations become significant. 

We may conclude that, in general, these results do offer some support for 
the findings of Middleton and Putney. The more ascetic the moral issue, the 
greater the difference between the responses of religious and nonreligious sub- 
jects. The results differ from those of Middleton and Putney; in that even for 
the most social item, religious subjects tend to be more severe than other subjects. 

Analysis of the comments made by subjects on all items reveals a cleat 
difference between the devout and the nonreligious. The nonreligious tend to 
argue that, if a form of behavior has no undesirable consequences for other 
people, it is not a legitimate matter for the moral evaluation of others; the 
devout claim that the individual's life is not his own, that all his behavior 
concerns God, and that hence there is no part of it which may not be the 
subject of moral evaluation on occasion. However, from the specific reasons 
given by the devout for their moral beliefs, it would seem that these beliefs may 
be serving more than one function. Some devout subjects express considerable 
disgust for antiascetic behavior, and offer as their reasons for condemning 
it the fact that they involve loss of self-control or self-respect, and the giving 
way to “animal” instincts of fear and lust. When such reasons, together with 
appeals to absolute authority, are given in support of an unqualified condem- 
nation, we may suspect that the beliefs are serving a defensive function. On the 


DEREK WRIGHT AND EDWIN COX 143 


т hand, some devout subjects are more concerned with future personal 
development, and see the ascetic moral rules as the necessary condition of 
"growth into self-respect and responsibility. Such subjects, though they condemn 
antiascetic behavior, are likely to qualify their judgments with the recognition 
‘that under certain circumstances such behavior is not detrimental to personal 
integrity. These subjects do draw attention to the fact that religious belief, 
in addition to all its other functions, can serve to support а program of per- 
па! development. 


D. SuMMARY 


A representative sample of 2276 boys and girls, drawn from the second 
т of the sixth forms of state maintained grammar schools in England, com- 
pleted a questionnaire which included items on moral judgment and religious 
belief and practice. The main findings were as follows: 
1. The subjects were most condemnatory of Color Bar and Stealing, and 
t condemnatory of Smoking and Gambling. , 
2. There was a general tendency for responses on one item to be associated 
with responses on the others, with the notable exception of Color Bar, which 
was largely independent. 

3. On nearly all items, girls 
were boys. 

4. Age, subject studied, and expe 
be related to moral judgment. 

5. Consistently, indices of religiosi 
of moral judgment. 

6. The more ascetic the moral issue, 
ious belief and practice. 


were more severe in their judgments than 
rience of coeducation were not found to 
ty were found to be related to severity 


the stronger the association with reli- 
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OCCUPATIONAL PREFERENCES OF TANZANIAN 
SECONDARY SCHOOL PUPILS** 


University College, University of East Africa, Dar ex Salaam 


E. L. KLINGELHOFER 


A. INTRODUCTION 


Many developing countries have come to accept Schultz’s (8) argument 
that education plays a crucial role in economic betterment and consequently 
treat higher professional or technical education as an investment. This view, 
expanded and systematized by Harbison and Myers (6), has led to the careful 
forecasting of manpower needs and the close articulation of higher education 
opportunities with the national needs and interests (3, 7). 

This precise predetermination of needs and opportunities is especially well 
illustrated in Tanganyika Five Year Plan for Economic and Social Develop- 
ment (9) and its accompanying Survey of the High-Level Manpower Require- 
ments and Resources for the Five-Year Development Plan, 1964-65 to 1968-69 
(2). The Survey classifies manpower requirements into а number of levels 
and occupations within those levels, estimates the number of individuals re- 
quired in those categories with which it is concerned; and the government 
undertakes to supply those manpower needs by providing the requisite educa- 
tion to the projected numbers of individuals. When the underlying economic 
assumptions are considered, this approach is attractive although it is not univer- 
sally accepted, even by economists (1). However, it does have certain 
and psychological implications that have seemingly not met with detailed 
analysis. The author has singled out one of these for consideration in this report. 

Education is virtually the only means through which a young person can 
upgrade himself socioeconomically in the developing countries. If the govern- 
ment has the almost exclusive and unchallenged right to decide whether а 
student will go on for postsecondary training—and what training he will 
receive—and the student's sole decision is whether or not he will accept the 
Eovernment's prescription, individual preferences and priorities become sub- 
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ordinated to national interest. The student will usually accept the opportunity 
for further schooling, although it may not be in an area in which he is 
primarily or sometimes even remotely interested, because he cannot afford to 
relinquish his chance at a higher education and the significant economic 
advantages it confers. 

To the extent that national needs and the occupational interests of second- 
ary school students coincide, problems of morale arising out of educational 
placements in which the individual and the planners conflict will be minimal. 
However, if the interests of individuals and the needs of society differ mate- 
tially, individual students will follow courses of study in which they are not 
primarily interested; and problems of motivation and morale will likely re- 
sult. Schultz, in discussing the social implications of his analysis, observes: 


There are many hindrances to the free choice of professions. Racial dis- 
crimination and religious discrimination are still widespread. Professional 
associations and governmental bodies also hinder entry; for example into 
medicine, Such purposeful interference keeps the investment in the form 
of human capital substantially below its optimum. (8, pp. 13-14). 


The high level manpower needs of Tanzania have been asserted but the 
occupational interests of secondary school students have not previously been 
studied. A main purpose of this investigation was to furnish data on those 
interests and to relate them to some of the allocations in the Manpower Plan. 

Another principal objective of the study was to ascertain differences in atti- 
tudes toward occupations as these were associated with the tribe or community 
to which the student belonged. It is commonly believed that these groups vary 
greatly in intellectual attainments, “need achievement," and other qualities 
which may be reflected in the way in which occupations are valued. The data 
provided a means of assessing some of these beliefs insofar as they relate to 
secondary school boys and girls coming from more or less “traditional” 
settings. 


B. METHOD 


In June and July, 1966, approximately 3500 Tanzanian students in the 
first three years of secondary school were surveyed, The field work was done 
by University College students who were in the secondary schools as practice 
teachers and who, after appropriate training, carried out this task as part of 
their duties. All of the subjects completed an intelligence test and answered 4 
questionnaire adapted from Gillespie and Allport's Youth Outlook Survey 
(4). The students studied were enrolled at 31 different secondary schools 


E. L. KLINGELHOFER 151 


throughout mainland "Tanzania and represent a sample of roughly 20 per cent 
of all students enrolled in the first three forms of secondary school. 

One survey question had the subjects rank, in order of preference, a number 
of occupations. Boys responded to a list of 20 and girls to one containing the 
names of 13 occupations. These are named in Table 1 below. The occupations 
appearing in the lists were selected primarily because of their familiarity to 
the student and because they represented a variety of types and levels of work. 

The rank responses of the students to this question were tabulated accord- 
ing to the sex, secondary school form level, and tribe or community of the 
respondent. Four Ásian communities and 23 African tribes had sufficient num- 
bers of subjects for separate analysis. After being tabulated, the raw ranks 
were transformed to C values according to the method described by Guilford 
(5, pp. 182-183). The C values were used to calculate mean normalized re- 
sponse values for each sex, form, and tribe or community. The resulting mean 
values vary directly as the rank preference: i.e., the lower the mean response 
value for an occupation, the lower the preference for that occupation. "The 
mean for all C values is 5.0 and the distribution is “exactly symmetrical” (5, 
p. 183) around the mean. 


C. RESULTS 


Altogether, 3146 students—2096 boys and 1050 girls—gave useful rank re- 
sponses to the occupational lists. "The mean normalized response values for the 
ranks were initially calculated for each sex, form, and tribe and the result 
examined for trends from Form 1 to Form 3. Among the four Asian male 
groups, only one occupation—Traditional farmer—showed any tendency for 
mean response value to decrease in the higher form levels; hence the results 
for the forms for each Asian community were consolidated. Among the African 
male tribal groups, there was a tendency for mean response values for the 
occupations of Accountant and Primary school teacher to increase slightly and 
those for Doctor, Officer in defense forces, and Politician to decrease, but the 
Changes were not great nor were they consistent over all tribes; so the forms 
for each tribal group were also consolidated. Ў: 

Next, the mean normalized response values for the Asian communities were 


? The 1 ructure in Tanzania is English-inspired. Primary school em- 
braces ihe pepe SMS of schooling (Standards I-VII). Secondary school consists 
9f six years of postprimary education; the first four years (Forms 1-4) culminate in 
the Cambridge ordinary level examinations. The candidate successful in these goes on 
to Forms 5-6, If he passes his Advanced Cambridge examinations at the end of Form 
6, he earns a Higher School Certificate, which is regarded as equivalent to one year 
9f collegiate study in the United States. 
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compared with one another. They showed a strikingly close resemblance and 
permitted the investigators to combine the Asian communities into one large 
group for each sex. A similar analysis for the African tribal groups had the 
same result. There seem to be no meaningful differences in the ways in which 
the various groups—within the Asian or African framework and, of course, 
according to the sex of the respondent—regard the occupations ranked.? 

The mean normalized response values of the combined Asian and African 
groups were then compared for the sexes separately, and are presented in 
Figures 1 and 2. Both the figures should be read in the same way; for example, 
the mean normalized response value given to the occupation "Barber" (the 
occupation least preferred by both groups of boys) was 1.6 for Asians and 
2.4 for Africans. 

The Pearsonian product-moment correlation between mean response values 
for occupations ranked by African and Asian boys is -L-.96. This indicates 
that an extremely close agreement exists between the mean values calculated 
for the ranks assigned by the two groups. The African boys tend to show less 
homogeneity in their ordering so that their range of mean values is somewhat 
smaller; when the mean response values are reconverted to simple ranks, the 
Asians assign relatively higher preferences to Accountant and Motorcar me- 
chanic, while the African boys are inclined to give a comparatively higher 
place to Radio announcer and Secondary school teacher. These differences— 
the only ones of any consequence—may reflect the influence of cultural factors 
and meet with comment in the following section. 

"The mean response values for girls are represented in Figure 2. The cor- 
relation between occupational means is: --.79, which also reflects a close agree- 
ment between the values calculated for the two groups. The African girls, as 
their male counterparts, also show more heterogeneity in their preferences for 
any occupation so that the range of means is somewhat smaller than that shown 
by the Asian girls. Converting mean values to ranks for the two groups and 
comparing the relative popularity of the occupations reveals that African girls 
show relatively higher preferences for Clerk in an office and Radio announcef, 
while Asian girls favor Nurse, Primary school teacher, and University teacher. 
This may also reflect cultural factors, as we have suggested below. 

To relate occupational preferences to national manpower needs—a major 

8 The tables giving mean normalized response values, by sex, for the African tribal 
groups and Asian communities have been deposited with the American Documentation 
Institute. Accompanying the tables are brief descriptions of the tribes and communities 
and a map locating them in Tanzania. A copy may be secured by citing Document 
number 9368 and by remitting $2.50 for photoprints, or $1.75 for 35-mm microfilm. 


Advance payment is required. Make checks or money orders payable to: Chief, Photo- 
duplication Service, Library of Congress, Washington, D.C, 20540. 
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concern of this study—one additional analysis was conducted. The number 
and per cent of times each occupation was ranked as a first or second choice 
was determined for each sex. The results of this are reflected in Table 1. 
For the boys, Doctor and Engineer, combined, collect 48.7 per cent of the 
first ranks with Lawyer and Modern farmer getting 7.8 and 6.9 per cent re- 
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FIGURE 1 
MEAN NORMALIZED RESPONSE VALUES OF 1795 AFRICAN AND 301 Astan SECONDARY 
ScuooL Boys TO 20 OCCUPATIONS, TANZANIA, 1966 
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spectively. Doctor, Engineer, and Lawyer are also the most popular 
choices followed by University teacher, Secondary school teacher, and Mo 
farmer. The girls give the bulk of both first and second choices to Doct 
Airline stewardess, Radio announcer, and Nurse. We have attempted to 
solidate and relate these first and second choices to the requirements 
Tanzania's Five Year Plan in Table 2. This appeared to be ап approp 
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step to take because over 80 per cent of the students named the occupation 
to which they gave first preference when responding to the question: “Name 
the kind of work you think you will most probably be able to take up." ‘There 
is considerable overlap between the aspirations and the expectations of the 
students. 

Table 2 reveals that the most pressing needs in the manpower plan are for 
individuals to fill what are designated Category B or C occupations: those 


TABLE 1 
NUMBER AND Per Cent оғ Times SELECTED OCCUPATIONS WERE RANKED Frast on 
SECOND IN ORDER OF PREFERENCE BY A SAMPLE OF 2096 Bors AND 1050 
GIRLS ENROLLED IN TANZANIAN SECONDARY SCHOOLS, 1966 


i. ee 

ris 

Occupation * % % 
Accountant 52 49 
Airline stewardess 132 11.2 
Barber 2 — 
Carpenter л 6 
Clerk in an office 21 8.7 23 53 
Doctor 18.8 29.0 18.2 1357 
Dressmaker 6 1.5 
Engineer 29.9 14.9 
Factory supervisor 24 43 
Factory worker J 1.0 
Housewife 1.5 1.6 
Laborer 2 -— 
Lawyer 78 9.0 
Modern farmer 69 5.7 
Motorcar mechanic 13 73 
Nurse 9.8 18.9 
Officer in defense forces 4.2 4.2 T: 
Policeman (woman) 5 11 1.5 
Politician 34 53 ri 
Primary school teacher 2.7 3.8 23 M 
Radio announcer 3.7 9.5 Pr Yi 
Secondary school teacher 52 7.6 2 
Secretary 65 * 
Shop sales girl 3 12 5 
Traditional farmer 1 65 $0 


requiring secondary schooling plus some additional training or experience. 
Indeed, Clerical workers and Primary school teachers combined account for 
more than half of the total projected needs; and Category A jobs (those re- 
quiring University training) make up only about 11 per cent of the total. 

The discrepancy between national needs and individual preferences is pointed 
up by comparing Columns 1 and 3 of Table 2. The boys and girls obviously 


and understandably are pushing toward traditional professional occupations, 
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regardless of the needs, and give little consideration to most of the subprofes- 
sional, clerical, or skilled manual occupations. 
If the students in our sample are taken to be representative of the roughly 
20,000 students in Forms 1-4 of secondary school—and these are, in many 
TABLE 2 


HiGHER LEVEL MANPOWER NEEDS AND OCCUPATIONAL PREFERENCES OF 
SECONDARY ScHOOL STUDENTS IN TANZANIA 


Required in 


manpower survey Student first or 
Category? % of % of second choices 
and occupation total category Jord 
Category A occupations 
(М = 2905) 10.9 
Accountant, professional ES 4.0 10.1¢ 
Engineer (all types) 1.3 12.0 49.6% 
Lawyer K 2.8 16.7¢ 
Physician 1.2 11.2 38.8e 
Secondary school teacher 2.6 244 12.7e 
University teacher 3 3.0 13.4e 
Category B occupations 
(N — 6555) 24.5 
Accountant, noncertif, 1.0 41 — 
Nurse, professional К) 37. 8.94 
Nurse, staff and duty 3.3 13.3 28. 
Radio announcer a 4 12.5° 
Primary school teacher 10.1 413 6.0е 


Category С occupations— 
skilled clerical (N = 13,282) 49.7 
Stenographers-secretaries 14 29 16.14 
Clerical workers 41.1 82.7 8.8e 


Category C occupations— 
skilled manual (N — 3980) 14.9 
Automobile mechanic 2.9 19.8 8.6¢ 


* The categories of occupations are given in the Plan as follows: Category A in- 
cludes jobs which normally require a University degree; Category B consists of jobs 
requiring 1-3 years of formal postsecondary (Form 4) education; Category C con- 
tains jobs that normally require secondary school education for standard performance 
of the full array of tasks involved in the occupation. Altogether the total manpowe! 
needs are estimated at 26,722 individuals. А 

b Student percentages are expressed as percentages of the total number of individ- 
uals, Irrespective of sex, designating the occupation a first or second choice. 

© Occupations responded to by 2096 boys. 

4 Occupations responded to by 1050 girls, 

© Occupations responded to by both boys and girls. 


cases, the same individuals who will be recruited into Category B and C occu- 
pations by 1969—or if they are assumed to reflect the preferences of students 
now farther along in the educational pipeline, all Category A occupations 
would be overchosen (in relation to needs) by a factor of at least six, except 
for Secondary school teacher. In Category B there should be a fairly close 


E. L, KLINGELHOFER 157 


match between need and demand for nurses, but the preference for Radio 
announcer is probably 75 times greater than the need, while Primary school 
teacher will be greatly undersubscribed and quotas will be filled with individ- 
uals not strongly motivated to that field. This general statement is also true 
for the Category C clerical occupations. There seems to be a clear imbalance 
between student preferences and the trained manpower requirements of the 
country and this imbalance may be expected to have a bearing on the success 
of the manpower plan. 


D. Discussion 


The results indicate that there is a fairly even level of interest on the parts 
of boys and girls in the various occupations ranked over the first three forms 
of secondary school. Asian and African boys show closely similar patterns of 
preferences and girls drawn from these two groups also agree in their prefer- 
ences, ‘The reason for this consistency probably reflects, in two ways, the influ- 
ence of the secondary school. The number of places is quite small, Less than 
4 per cent of the school age population is able to enter Form 1 and only about 
12 per cent of applicants eligible to enter Form 1 are accepted. Because of 
the strict competitive selection, it is unlikely that there would be any differ- 
ences within the subgroups comprising the African or Asian samples. The rigor 
of the process of selection would standardize the intake so that differences 
between ethnic subgroups would be unlikely. The close similarity of occupa- 
tional preferences of Africans and Asians within sex groupings may reflect the 
assimilation of similar sets of values in the schools. The secondary educational 
system has been superimposed on the traditional cultures and even now con- 
tains a significant number and proportion of expatriate teachers. It is certainly 
plausible to speculate that European values have been assimilated and have 
largely determined the order of preferences which go heavily toward the 
higher professions. Where there are differences—Asian boys show relatively 
stronger preferences for Accountant and Motorcar mechanic, while African 
boys give relatively more prominence to Radio announcer and Secondary school 
teacher—these may be attributed to socioeconomic factors differing for the 
two groups. Asians own and operate most of the retail businesses in the coun- 
try—apparel, food, hardware, automotive sales, service, and repair, etc.—and 
this feature of the Asian community probably opens up Accountant and Motor- 
car mechanic as familiar and concrete occupational possibilities to Asian youth. 
The preference of African boys for Radio announcer may reside in the fact 
that the radio is one of the two or three most sought after personal possessions 
and constitutes the most important sources of news and entertainment for the 


158 JOURNAL OF SOCIAL PSYCHOLOGY 


African population of Tanzania (10). Preferences here may simply 


about the duration of work after leaving school. The Asian girls are thoug 
to be more likely to continue in employment after marriage; and the pro 
of long-term employment may lead to the development of a preference 
occupations that are more nearly professional, provide future promise, | 
offer stability. African girls have not tended to continue in employment ft 
marriage and this expectation may act to encourage the choice of occupati 
involving less formal training or offering more widespread social contacts. 
In general the occupational objectives of the African groups—boys and zi 
both—were more modest than those of the Asian groups. On the whole th 
rated less desirable work higher than their Asian counterparts, and did m 
rate so highly the more difficult or demanding jobs, even though the ordei 
preference was very close, 1 
It is especially interesting to consider some of the implications of these d 
for the Five-Year Development Plan in Tanzania. The Plan and its associati 
Manpower Report, as we have seen in the preceding section, provides for tl 
training of "high level" manpower and indicates the number of individu: 
required in certain professional, technical, and skilled occupations. 1 
From the data in Table 2 it seems clear that national needs and individu 
preferences of secondary school students are badly mismatched. The stude 
are overwhelmingly oriented toward high prestige professional occupation 
while the country requires clerks, skilled manual workers, teachers, and nurs 
"These needs probably cannot be met without applying considerable social prt 
sure or control; and the necessity of using this sort of stratagem has serio! 
implications for the motivation of individuals recruited into these occupatio 
their occupational life expectancy after training, and the caliber of their p 
formance while in the job. Their rates of failure and voluntary attrition wi 
probably be higher than those for individuals engaged in the same occup: 
through choice, and will be higher than anticipated by the manpower plan 


E. Summary 


In this investigation the rank order of preferences of 3146 Tanzanian s 
ary school students for several different occupations was studied. The lev si 
preference for each occupation was expressed as a mean normalized respon 
value. These data were presented and analyzed by sex and by African tri 
or Asian community. Similarities of interests for the individual sexes, irresp 
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tive of African tribe or Asian community, were considered; and the implica- 
tions of these preferences for some aspects of the manpower planning features 
of Tanzania's Five-Year Plan are suggested and discussed, 
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FREE VERBAL ASSOCIATIONS* + 


Center for Research in Social Systems, The American University 


Lorann B. Szatay AND Jack E. Brext? 


A. INTRODUCTION 


Floyd Allport, in discussing the pervasive importance of meaning in percep- 
tion and cognitive processes, concluded that "aside from purely automatic 
habits and reflexes, there are scarcely any parts in the whole field of behavior 
into which meaning does not enter” (1, p. 535). From the viewpoint of the 
psychologist, meaning emerges in the form of a mediating reaction, ты (36), 
or, less technically expressed, as a decoding or encoding reaction frequently 
characterized by highly individual and subjective content. The coding reactions 
are viewed as being conditioned by experiences that are in part peculiar to 
each individual and in part shared within groups. The psychologist’s concern 
derives from the influence exerted: by. subjective and group meanings on indi- 
vidual and group behavior in general and on interaction and communication 
in particular. While no dictionaries show the meanings that Christian Scientists, 
drug addicts, and pharmacists, for example, have for “drug,” recognition of 
their different meanings is needed to understand their different behavior. 

The experimental work of the last decade shows a heavy focus on the use 
of the Semantic Differential technique introduced by Osgood and his asso- 
ciates (36). Stimulating extensive cross-cultural investigations, the Semantic 
Differential has produced rich empirical data on the evaluative-attitudinal 
meaning components identified as connotative meaning. The limitations of this 
instrument have been repeatedly acknowledged, however, by Osgood, who 
argues for the need to improve research methods for a broader, more inclusive 
empirical information on meaning: “It is clear on purely intuitive grounds 
that the concepts HERO and SUCCESS, which have similar Semantic Differ- 
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ential profiles, do not mean the same thing. It is therefore important for studies 
of the behavior of culturally heterogenous groups to develop additional proce- 
dures for the measurement of similarity of concept meaning" (14, p. 1). 

In the scarch for such procedures, the use of free verbal associations emerges 
as an important approach. Free verbal associations are widely used in investi- 
gations on verbal learning and verbal behavior. Although most investigators 
avoid references to meaning, reported results do indicate that frec verbal asso- 
ciations offer empirically well-founded information on various meaning aspect 
Noble's “m” (33), for example, provides a reliable and valid measure of 
“meaningfulness.” 

Also, a variety of measures have been derived which quantify the relation- 
ship of stimulus words based on shared responses (8, 11, 12, 15). Recent in- 
vestigations indicate that these measures based on commonality in association 
content yield results that are in good agreement with results obtiined with 
similarity measures (8, 14, 18), suggesting that the amount of shared asso 
ciations reflects the degree of meaning similarity. Deese (11) introduced the 
idea of defining “associative meaning” operationally as “the distribution of 
responses” obtained in free verbal association tasks. Factor-analyzing relation- 
ship clusters inferred from the overlap of associative response distributions 
factors were obtained, which appear to reflect genuine meaning components. 
The data reported by Laffal and Feldman (25, 26) provide additional empiri- 
cal evidence in support of these findings. 

In a different context authors refer to cultural norms which they derive 
from associations. Russell and Jenkins (39) developed associative response 
norms on the basis of the standard Kent-Rosanoff stimulus list and inter 
preted these norms as “standards of the verbal culture.” Cohen, Bousefield, 
and Whitemarsh (10) derived "cultural norms” using collective nouns, cate- 
gory names (e.g., tree) as stimuli. They obtained single response items (maple, 
oak, pine) in proportions that seem to be characteristic for the United States 
subjects used in these experiments. Up to now, no comparative study has been 
done using associations of culturally distinctly different groups. The above 
findings of the experimentally oriented literature suggest, however, the culture 
specificity of free verbal associations and their relevance for measuring cultural 
meanings. 

B. THEORETICAL CONSIDERATIONS 
1. A Model of the Meaning-Mediated Association Process 


In current theoretical explanations for meaning, the two-stage models 21 
broadly used (32, 36). They emphasize that the word as a stimulus does not 
elicit overt behavior directly but only through a mediating or decoding reaction 
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This decoding reaction commonly called meaning is conditioned to the words 
by learning—that is, experience. It is suggested that the very same rationale 
holds for the process of association. 

The association may be viewed as a process embracing three sequential 
events or stages that are introduced with the presentation of the stimulus 
word, 


I e II * m »R 
— — À —» fm ————— P» fae -—H 
decoding 7 =" elicitation ~ "^ encoding 
While the decoding and encoding stages are self-explanatory, the elicitation 
stage, deriving from the decoding reaction, requires more discussion. ‘The elici- 


tation, or what is really the event of association in a stricter sense, emerges from 
the decoding or meaning reaction. Accordingly, it may be expected that this 
process of clicitation is influenced or even controlled by the content or char- 
acteristics of the decoding or meaning reaction, Tm. While little is known 
about this reaction in neurophysiological terms, it is generally agreed by the 
partisans of the two-stage models (32, 36) that the meaning represents a 
complex reaction composed of a number of simultaneous, coordinated, neuro- 
logical events? At the more common sense level, the meaning of a word is 
composed of a number of components. 

Thus, the elicitation is interpreted as a process in which associative responses 
are mediated by the various elements of the decoding reaction. It is assumed 
that responses are elicited that have meaning elements identical with the 
mediating fraction of the stimulus meaning—that is, responses are produced 
that have meaning elements in common with the stimulus meaning. Р 

These elements with their relative salience constitute what may be identified 
35 subjective meaning content. This visualization of the meaning reaction as 
à bundle or mosaic picture of meaning elements makes it plausible to assume 
that its mosaic elements, rather than the meaning reaction as a whole, control 
the elicitation process and lead to particular responses. When a single associ- 
ative response is requested, it is likely that the most salient element will bring 
about a response. When more than one response is requested, it is likely that 
the responses obtained will represent the products of more than one meaning 
elment. The most salient meaning element starts the response Production ; 
the less salient meaning elements elicit the later responses. Thus if continued 
association tasks are used, the sequence of the response may be expected to reflect 
tank order of the meaning elements in subjective salience. The individual 
= 


2 “Th i ign is conceived to be a simultaneous bundle of distinctive 
semantic aeae с еы ү Т with component r's of the total г”; said Osgood 


Most recently in discussing the nature of the meaning reaction (35). 
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response repertoire obtained will contain the main elements of the stimulus 
meaning. In other words, the individual response repertoire of a given person 
is expected to include the main mosaic elements of the stimulus meaning in 
proportion to their salience for the person. 

The group meaning may be conceived as an aggregate of elements shared 
by members of the group, and the shared meaning elements may be roughly 
identified on the bases of the shared responses. The mosaic picture of the 
group meaning for a word may be expected to emerge from a response list of 
shared associations given by the members of the group in continued free verbal 
association tasks. 


2. The Reconstruction of the Stimulus Meaning from Associations 


The assessment of the group meaning requires the identification and pro 
portional representation of the meaning fractions from the associative responses 
provided by the group. It involves three types of tasks: 

a. Identification of elements that the single responses share with the stimulus 
meaning. The response words, just like the stimulus words, have subjective 
meanings represented by subjective coding reactions. These meanings are also 
characterized by a composite or mosaic-like meaning content. The identification 
constitutes a task aimed at the separation of those elements, the mosaic parts 
of the response meaning that are identical with the corresponding elements of 
the stimulus meaning. Such identification of meaning elements would require 
the assessment of the subjective meaning of each response word—a much #00 
demanding task. Operating at a group level, we may take recourse to a sim 
plification, however, and reduce the task of identification to a manageable prob- 
lem. Instead of being concerned with subjective meaning of the respondents, 
we may use the simple lexical meaning (or what is frequently called the sig 
nificatum) of the response words. By relating this to the stimulus meaning, 
we may look then for elements: that is, relationships that connect the respons 
with the stimulus. What we are looking for here emerges if the content of 
the response word is considered in the context of the stimulus word. 

Most of the responses to free verbal associations suggest quite clearly © 
single (or a few) relationship alternatives. Thus, subjects, who are instructed 
to order responses according to their content-determined relationship to the 
stimulus word, perform this task with high reliability: that is, they аге if 
agreement about the relationship of these responses to the stimulus (605 
hungry). They agree that some of the responses (e.g., milk, meat, fruit) 
belong to one particular relationship category (food items instrumental ™ 
drive reduction); other responses (e.g., eat, swallow) to another (action 
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aiming at drive reduction). Subjects are also in substantial agreement in iden- 
tifying the words (e.g., municipal, hazardous, dictionary) for which no plau- 
sible relationship is apparent. The identification of a relationship requires 
focusing on a particular meaning element of the broader response meaning. 
For the high frequency responses, it is especially easy to find a plausible rela- 
tionship alternative and to identify the elements they share with the stimulus. 

b. Organization of single meaning elements in broader components. Re- 
sponses similar in content (e.g, milk, meat, fruit) and characterized by 
analogous relationships to the stimulus (hungry) lend themselves to be com- 
bined into a few higher-order categories (e.g. food varieties). Using such 
categories allows one to reduce the many different responses into a few main 
meaning components. 

The relationship of the category to the stimulus becomes self-evident when 
the category is considered in the context of the stimulus word (e.g., food 
varieties—hungry). The identification thus does not require separate inferences 
ot even additional formulation ; it automatically emerges from the simultaneous 
presentation of the categories (food varieties, ways of consumption) in the con- 
text of the stimulus (hungry). 

The relationship of such categories to the stimulus is so common and obvious 
às to appear pedestrian. This impression of banality, however, disappears when 
data from groups with different backgrounds are compared and when stimulus 
words of popular interest (e.g., social issues) are used. 

c. Salience of the meaning elements and components. We usually have 
fairly clear ideas about the main components of meaning or at least we are 
able to list a vast array of potential components. The critical question is not 
whether these are components or not, but how much is each component charac- 
teristic for a particular group. Ralph White (45) demonstrates that for the 
Meaning of CAPITALISM, the critical issue is the differential salience of 
Meaning components various nations have; for instance, the priority of the 
economic components of capitalism over the social components for Americans, 
and the priority of social over economic components for Latin Americans. 

At group level, the salience of a mosaic element derives from the sum of 
individual salience values obtained from vatious group members. Thus, in 
order to determine group salience, we have to know the individual salience of 
elements. As indicated earlier, the individual salience of a meaning element 
is assumed to be reflected by the rank place the response was given. 

Garskof and Houston (15) argue that limiting association tasks to the first 
Tesponse is arbitrary and they propose the use of continued association tasks 
for studying individual response hierarchies, They clearly identify the problem 
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of the differential importance or salience of associations emitted at 
places in the individual sequence. 
The approach adopted by the présent authors is based on the as 
that the subjective salience or importance of a response is reflected by tl 
bility of these responses reoccurring as associates. In order to derive wei 
scores empirically, subjects (Ss) were given the same free association ta 
week apart. The percentages of first test responses (by place of emissió 
that reoccurred in the retest of the same subjects were calculated. The s 
coefficients obtained provide the following series of empirical weighting 800 
for place of emission, mean percentage of stability, and weighting score, 
spectively: I, .60, 6; 15.48, 5; III, .42, 4; IV, 34, 3; V, .32, 3; VIG 
_ 3; VII, .25, 3; VIII, .20, 2; IX, .15, 2; and X, .11, 1. These weighting si 
allow us to assign weights to responses obtained at various places of emiss 
and to express the salience of the meaning element that the single res 
contribute to the whole meaning mosaic. The group salience of elements 
components may then be determined by a simple sum of individual salien 
values. 


C. THE PROBLEM 


The purpose of this study is to explore the potential utility of free v 
associations for the empirical assessment and the quantitative analys 
cultural meaning content. Cultural meaning is conceived here as a compos 
content derived from the summation and proportional representation of t 
elements of the individual coding behavior of members of culturally hom 
geneous groups. 

The study of cultural differences in meaning reaction is not only a relev 
field of application, but is also the best testing ground for a research app 
derived from the above conceptualization of the association process. In 
to the purely laboratory types of experiments, this study explores the 4 
ment of associative data on cultural meanings with well established cu 
facts as well as with various criterion measures. 

The empirical approach of collecting native language associations is deriv 
from the above conceptualization of the associative process. This apP 
pursued the objective to promote the production of meaning-mediated 
ciations, which are expected to carry elements of the stimulus meaning, 
provide empirical data for its reconstruction. It is well known that the 
tions of the administration of the association task influence whether 0! 
meaning mediated responses will be obtained. 

The literature is rich in data on various conditions influencing 495091490 
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response production (4, 9, 21, 40, 41) and suggests that the following con- 
ditions would increase the amount of meaning-mediated responses, producing 
more response words which reflect the stimulus meaning. 


1. Reduced Time Pressure 


Urging $ to respond immediately is likely to increase the portion of verbal 
habit-determined responses, whereas providing $ with more leisurely conditions 
is likely to increase the importance of the meaning content of the stimulus word. 


2. Continued Association E « 


The first immediate response is somewhat more likely to be mechanically 
determined than are later responses, which, being less direct, result increasingly ~ 
from cognitive processes attached to the meaning content of the stimulus word. 
This assumption is supported by experimental and theoretical considerations 
on the comparison of single versus continued association tasks (9, 15). In con- 
tinued association tasks with one-minute time limitations, the share of first 
Tesponses is only a minor fraction (1/6 to 1/10) of the total response 
Production, 


3. Written Form 


A writing task may be expected to increase the amount of content-determined 
Tesponses for two reasons: First, in the face-to-face situation, $ may feel a 
Pressure to respond quickly even though the experimenter makes no conscious 
effort to create such pressures. Second, vocalization may lead to acoustically 
Conditioned responses where the sound pattern plays a more important role than 
Meaning content. 


4. Group Administration 


Giving the task to more than one person at a time should remove some of 
the stress factors at work when S is alone with the experimenter. Group ad- 
ministration of the task should prove the least stressful, especially where there 
5 а guarantee of individual anonymity. 

Four main hypotheses were tested in this study: ; 

l. Free verbal associations of different cultural groups will yield articulate, 
group specific, and consistent associative responses, providing bases for the 
“mpirical assessment of group specific meaning content. . Г 

2. Differences between cultural groups in associative meaning content will 


"тее with basic cultural facts and knowledge. 4 : J 
З. Inferences about cultural meanings based on associations will agree with 


168 JOURNAL OF SOCIAL PSYCHOLOGY 


criterion measures of cultural perception of meaning content (judgment of 
meaning similarity). 

4. Inferences about an attitudinal-evaluative meaning component based on 
association will agree with criterion measures on this attitudinal component 
provided by the Semantic Differential. 


D. METHOD 


Data from members of three quite different cultures on a variety of aso 
ciational and other psycholinguistic tasks were obtained in order to determine 
the utility of the associative approach in describing and measuring cultural 
differences in meaning. 

Subjects represented three cultural groups—the United States, Korea, and 
Colombia—who were similar in educational level, socioeconomic status, and 
sex distribution. The 50 Ss in each group came from universities in the Wash- 
ington, D.C., area. 

The original English word list contained a greater number of words (са. 
70) belonging to various reference areas. This list was given to three Korean 
English and three Spanish-English bilinguals for translation. The bilinguals 
worded independently, and the English words for which different translations 
were obtained within the same language were examined. If the different trans- 
lations suggested difficulties in finding a single unequivocal translation, the 
English word was dropped from the word list. This strategy was adopted in 
order to maximize comparability in the context of the present methodologically 
executed study.* А 

By this method а final stimulus list was obtained containing 48 stimuli 
representing four main areas of meaning. Each meaning area included threë 
different word categories: nouns, adjectives, and verbs. The words had to fit 
into the category system and to have approximate translation equivalents in 
the three languages. 

In order to shorten the report, discussion of results was limited to the follow- 
ing stimulus words representing reference spheres I (food) and VI ( family)- 


* The authors are inclined to conclude that perfect stimulus equivalence rarely 
exists in cross-cultural investigations. Even visual stimuli—e.g., pictures—are di 1 
ently perceived and do not mean the same for people with distinctly different cultut® 
backgrounds. Thus, instead of adhering to the principle of stimulus equivalence 
may be more fruitful to aim for the closest available translation equivalents. 
distance separating so-called untranslatable foreign terms—like esprit, Weltanschauung 
simpatico, partiinosti—from the closest available English translations is neither 3° 
dental nor culturally irrelevant. These words are usually culturally most characteriste 
and thus important in substance-oriented cross-cultural investigations. The dependent? 
on the semantic peculiarities of single words may be reduced by using an increased 
number of stimulus words from the same meaning area. 
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following translations (Figure 1) were used for the eight stimuli words 
d at certain length in our report: 

is report contains only experimental results from free association tasks, 
rity judgment tasks, and the Semantic Differential. 


1. Association Task " 


h group was given a continued free verbal association task in its native 
дс. The task was administered in written form on cards with a stimulus 


Colombian Korean 

Hambriento Baegopun 

Arroz Tsal 

Alimento Ünsik 

Comer Moknunda » 

Filial Hyohaeng hanun 

Familia Kajok 

Ancestor Chosang 

Respetar Chorigy ong handa 
FIGURE 1 


1 Tus STIMULUS Worps Usep IN THE NATIVE LANGUAGES 

E Korean words are given here in transliteration. 

fd printed at the top and with lines for the corresponding associations pro- 
d below. 

l subjects received the same stimuli in different random order and were 
у ed to avoid “chain” responses (eg. writing, pen, ink, blue, ocean, sail, 
he stimulus TABLE), and to give only responses directly connected to the 


2. Similarity Judgment Task 


\ agreement with the practice frequently reported by the literature (2, 13), 
hilarity judgment was used in these experiments as a criterion measure of 
tive meaning content. In the present study, Ss chose from a selection 
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list of 11 words the one perceived to be most similar to the reference word, 
All words belonging to the same reference area were presented on a single 
card. The words of the selection list were printed in a circle around the refer 
ence word at approximately equidistant points to control for positional effects, 
Ss were told to read the center word, then the words ringed around it, and to 
"encircle that one word which you evaluate to be the most similar in meaning 
content to the word in the center." Cards were presented in random order 
with a time limit of 20 seconds allowed for each selection. 


3. The Semantic Differential Task 


Another criterion measure used was derived from the Semantic Differential: 
composed of scales representative for the three main factors of evaluation, 
potency, and activity (36). The Semantic Differential was administered 08 
the same words included in the stimulus list. The scales and the standard 
instructions used by Osgood and associates were administered in S's native 
language. 


E. RrzsULTS AND DISCUSSION 
1. Consistency and Culture Specificity 


In the continued free verbal association task, the members of the United 
States and Colombian groups produced an average of about six words. The 
Koreans averaged about five words, because the Korean alphabet takes longtf 
to write. The individual repertories have many responses in common 
can be summed and ordered into group response lists. The scores used as # 
basis for ordering reflect both the frequency of a response and its rank plac 
of emission. Group lists averaging about 300 responses were obtained, involv- 
ing from 150 to 200 different response words per stimulus. The number of 
words given by at least two persons in a group of 50 is substantially less (40- 
70); but as these shared responses are given frequently, they account for * 
great proportion (60-75 per cent) of the total response production. The grou? 
response lists, starting with the high score and ending with the low sf 
items, offer a clear priority or salience order of responses (see Table 1). 

For each group, distinct priorities emerge which become increasingly mean- 
ingful if considered within categories of homogeneous items (e.g., varieties 
food or varieties of countries). Within the context of the stimulus HUNGRY 
for example, the Korean responses dealing with food offer the following 014 
of priorities: cooked rice 107, food (in general) 51, rice 44, meat 11, soup 1% 
and so on. While this salience order is distinct, it raises the question of whethe! 
such priorities are accidental or whether they reveal consistent and general 
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TABLE 1 
Tur 20 Hicnest-Scorinc ASSOCIATION Responses то THE $тїмишз HUNGRY 
U.S. subjects Korean subjects Colombian subjects 
Responses Scores Responses Scores Responses Scores 
Food 220 Cooked rice 107 Meal 107 
Eat 76 Beggar 100 Food e 73 
Thirsty 61 Food $1  — Hunger І 65 
Starve 59 Poverty 46 Роог 59 
Stomach 52 Rice 4 Beggar 43 
Poor 43 Money 40 Poverty 35 
Pains 27 People/person 35 Money 36 
People 18 Poor 28 Bread 35 
Famished 18 Pitiful 27 Starving 29 
Needy 17 War 26 Thirsty 29 
Diet 16 Child (ren) 23 Work 27 
Meat 15 Orphan(s) 22 Necessity 18 
Money 14 Eat 13 Thin 17 
Always 14 Time 12 Man 15 
Cake/pie 12 Stomach n Miser 13 
Tired 11 Meat 11 Rich (ness) 12 
Help 11 Soup 10 Stomach 12 
Communism 11 Sympathy 10 Meat u 
Cook 11 Poor people 10 Laziness n 
Beggar 11 Must eat 10 Fast u 


MUS ee 


izable patterns. To explore the problem of consistency we may compare the 
priority orders obtained among the four food-related stimulus words. As ap- 
pears in Table 2, there is remarkable consistency for the Korean group. The 
priorities of responses to the four food-related stimulus words for the other 
cultural groups reveal similar consistency for food responses as well as for 
other classes of responses—persons, places of consumption, and so on. The long 
Tesponse lists, including responses, belong to a relatively few main relevant 
content categories and suggest the value of organizing responses on the basis 
of their content. Responses of similar content can be grouped and their scores 
summed into a single category score. The main content categories suggest 
main components of content with distinct relative proportions. Table 3 shows 
the category composition of responses to HUNGRY for the three cultural 
groups. , 

"The response lists reflect stable priority orders of single response items, and 
the category lists suggest content in terms of main components; together they 
Teveal consistent and generalizable characteristics of cultural groups as well 
às differences among them. { Р 

А variety of measures of the relationship between words exists as reviewed 
by Marshall and Cofer (30), based on the associative overlap. For use with 
the continued association task and in agreement with the conceptualization of 
the meaning-mediated association process, а new overlap measure was developed 
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called the Index of Associative Affinity. It expresses the associative overlap or 
similarity based on the relative portion of the scores of overlapping responses 
elicited by two stimulus words. To calculate the extent to which one stimulus 
word (A) is related to another (B), the scores of the associations in common 
for A and B (plus the direct elicitations of B by A when different stimulus 
words are used) are divided by the total score of associations elicited by A. 
The fraction is multiplied by 100. This index is adapted to the use of continued 
association tasks and allows for the differences of responses emitted at various 
places in the individual sequence. It can be used to measure the relationship 
of any two words for a particular group (as well as to assess the relationship 
of any two groups in respect to any particular word). This index also reflects 
the difference in relationship depending on the direction of the comparison of 
the stimulus words A to B or B to A. The affinity of HUNGRY to POOR 
may differ from the affinity of POOR to HUNGRY. 

Table 4 gives the intracultural associative affinity indexes for the stimulus 
words in two reference spheres for the three groups, The matrix data show 
many distinct differences that are consistent. The affinity of the word RICE 
to the other food-related stimulus words is always higher for Korean and 
Colombian students. The relationship of HUNGRY to POOR is consistently 
higher for the foreign groups. This is also true for the other poverty-related 
terms like beggar and money. They support Kluckhohn’s (24) position that 
hunger represents different needs and has different meanings for people of 
other cultural backgrounds. The data display considerable consistency, espe 
cially in culture-specific items that are closely related in meaning (e.g., POOR, 
BEGGAR). 

The overall amount of commonality between group response lists may be 
expressed by a different version of the associative affinity index, the intercultural 
or intergroup index (43). This index shows the amount of associations shared 
by two groups and may be used to express the affinity or similarity of responses 
obtained from different groups to the same stimulus word, The index shows 
a high similarity on words like SCHOOL and FAMILY, which student 
groups may be expected to have in common, and a low similarity on culture 
specific items like REVENGE, OFFENSE, ANCESTORS. The intergrouP 
affinity of the word HUNGRY for the three cultural groups is in between- 
Since culturally different groups are compared, group specificity may be inte! 
preted as culture specificity, 


2. Agreement with Cultural Knowledge 


The inventories obtained from different groups, as previously discussed, show 
many single items that are characteristic for the particular cultural group; 
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the responses receiving the highest scores usually differentiate the groups clearly 
(see Tables 1 and 2). More general conclusions about differences betwees 
groups can be derived by comparing categories of responses. The categoria 
of responses to the stimulus HUNGRY, for instance, show that the United 
States group scored highest on food in general; the Korean group, on rie, 
on poverty, and on people in general; and the Colombian group, on driv 
states, poverty, negative feelings, and meals in general (see Table 3). 

There is certainly nothing new in finding that rice represents the most im 
portant food for Koreans or that meat is the most popular food category for 
Americans. However, in the present investigation, it is not the novelty of th 
finding that is important, but rather the correspondence between the associ 
tive data presented and well-known cultural facts. The validation obtained 
on easily verified concrete stimuli provides a basis for expecting similar finding 
for the abstract stimuli, such as DEMOCRACY and CAPITALISM. 

Results on four family-related words of medium level of abstraction will 
be discussed next. As the following data suggest, they represent words of var- 
ing culture specificity (see Table 5). 

The responses to the stimulus ANCESTORS show extensive differences ій 
main component categories, in agreement with well-established cultural knowl 
edge. The United States group gives the largest number of references to distant 
relatives and foreign nations, indicating that for this group ANCESTORS 
primarily means a distant relative of foreign origin, References to conditions 
of prehistoric life (caves), animals (ape), reveal the influence of evolutionary 
theory. 

The Colombian group gives the largest number of references to ancient 
times, to genealogy, and to historic occasions, suggesting that for the Colombiat 
group, ANCESTORS refers primarily to national and family history. Th 
importance of family background, history, genealogy are broadly acknowledged 
characteristics of the Spanish-Colombian culture (28, 37). 

The Koreans list relatively few references to the above categories which, 
however, seem important to the United States and Colombian groups. Rather: 
the Koreans concentrate heavily on death, funeral ceremonies, and religious 
and mythical words, suggesting that feelings of worship and devotion am 
important. The Korean responses show ANCESTORS as mythical perso% 
of supernatural character with nearly divine status. These findings are СОЁ 
sistent with the high incidence of ancestor worship in the Korean society 
7, 22, 34). 

Certain content categories are directly informative, Responses in the various 


food categories elicited by HUNGRY and TO EAT, for example, refet 
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food preferences that are the main items of the group's diet. The frequent 
mention of foreign countries elicited by ANCESTORS reflects the relative 
importance of these countries as main sources of immigration. 

Other content categories are less direct in their indications. The frequently 
used "people in reference to . . ." category indicates the personal, humas 
character of an issue. A centering of responses on such categories as foreign 
countries and foreign geographic locations shows the distant origin or remote 
character of an issue. High response frequencies in such content categories # 
references to pain, sickness, and negative feelings indicate a negative attitude, 
just as those in categories of fun and entertainment indicate positive evaluation. 

The same content-analytic method, when applied to the responses of tht 
word FAMILY, shows less dramatic differences. In order to simplify the 
comparison, the following presentation of the results will be limited solely t 
the United States and Korean groups. Semantographs show the main semantit 
components as segments presented graphically. The length of the segments is 
proportional to the category score of the groups. The Korean group produced 
fewer responses than did the United States group, probably because theif 
writing takes longer. The amount of this difference (14 per cent on a bast 
of 36 stimuli) was added to the Korean category scores in Figures 2-4. 

Despite this correction, the United States group scored higher on all but 
one component of FAMILY. No direct evidence accounts for this difference, 
which, according to Noble's measure (33), indicates a greater concern with 
FAMILY by the United States group. 

While the proportions of most components are similar for Korean and 
United States groups, the United States groups show a greater concern with 
“shared items and common events” and “home and related equipment”—with 
many references to home, car, house, gatherings, fun, dinner: that is, item 
and events belonging to the family scene. Such findings are largely supported 
by the observations of authors (29, 46) describing American middle-class 
values and changes in the way of life in the modern age of “kitchen revolution” 
and “appliance revolution.” 

Furthermore, in making extensive references to love, closeness, togetherne 
friendship, and warmth, the United States group reveals greater concern wi 
the emotional-relational aspect of family. This is in agreement with observ* 
tions emphasizing affection and functionally important interpersonal relation’ 
as the main cohesive force of the United States family (38). The categori 
composition of FAMILY—especially in respect to the personal components— 
shows general similarity between United States and Korean groups. 
United States emphasis on home, equipment, event-entertainment, and em 
tional-relational aspects emerges as agreeing with cultural observations. 
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Although the United States graph for FAMILY was more inclusive than 
the Korean, the semantograph of FILIAL suggests that this word has a richer 
content for the Koreans. Also, the main components of this word differ greatly. 
For the Korean group, the primary emphasis is on "children"; on “parents” ; 

LI 


RELATIONSHIPS, 
FEELINGS AND EMOTIONS 
246 


SHARED ITEMS 
AND COMMON EVENTS 


HOME AND 
RELATED 


EQUIPMENT 159 


MISCELLANEOUS 135 g RELATIVES AND 
OTHER PERSONS 
117 
REFERENCES SIZE AND BACKGROUND 
INDICATING DISCORD ANAS, 
a BIOLOGICAL REFERENCES 
05=35 K-3 


—— 


US POLYGON BASED ON A SCORE OF 1512 
KOREAN POLYGON BASED ON A SCORE OF 1098 (CORRECTED) [xx d 


FIGURE 2 $s 
CATEGORY COMPOSITION OF FAMILY For THE UNITED STATES 
AND KOREAN GROUPS 


X Mics 
on “respect, love, and other feelings”; and on “qualities and characteristics 


of the filial person. The sizeable components derived from references to “myth 
and religion” and “service and duty” have practically no United States equiva- 
lents. Also, the "education, moral” component is little shared by the United 
States group. The central position of FILIAL in the culturally characteristic 


focused on the filial relationship of children to parents and which implied 
service and duty—explains much of these results (16, 20, 27, 31). 
The last word used in the reference sphere of family is TO RESPECT. 


CHILDREN 
US=34 
219 


RESPECT, LOVE, 
OTHER FEELINGS 
153 PARENTS 
133 
QUALITIES AND 
CHARACTERISTICS 
144 


FAMILY, RELATIVES 


MYTH AND 
126 RELIGION 
us=0 


SERVICE, DUTY |05 
Us=3 


EDUCATION, MORAL 90 104 


68 57 MISCELLANEOUS 
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Korean education process, as supported by religious mythical elements—which 
US=26 


8 
OTHER PEOPLE SOCIAL, PSYCH. CONCEPTS 
US=52 K=46 
ASSOCIATIONS, 
POLITICAL, BUSINESS 
=33 
US POLYGON BASED ON A SCORE OF 770 —— 
KOREAN POLYGON BASED ON A SCORE OF 1256 (CORRECTED) pir x] 
FIGURE 3 


CATEGORY COMPOSITION or FILIAL FOR THE UNITED STATES 
AND KOREAN GROUPS 
"The semantograph suggests that the Koreans show large components dealing 
with various categories of people— "people, persons in general"; “educators, 
teachers" ; and “family, parents.” The marked cultural emphasis of Koreans В 
well documented оп RESPECT—with special regard to parents, elders, 
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teachers (5, 16, 23, 34). In contrast, the United States group emphasizes 
“qualities, human values.” This suggests that for the United States group 
RESPECT is less attributed to peoples and roles in general than to specific 
qualities and values. 


QUALITIES, HUMAN VALUES, SIMILAR MEANING, 
260 SYNONYMOUS EXPRESSIONS 


ACTIONS, PEOPLE, 
CONSEQUENCES, 59 PERSONS 
06-85 IN GENERAL 
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CATEGORY COMPOSITION OF TO RESPECT FOR THE Unrrep STATES 
AND KOREAN GROUPS 


'The interpretation and the insights offered by certain portions of the 


associative data may require substantially more detailed attention and docu- 


mentation, depending on the objective of the investigations. Our present con- 


cern was limited to showing the basic components of four family-related words 


and exploring their agreement with well-known cultural facts. 
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3. The Validity of Associative Inferences on Cultural Meanings 


In order to obtain additional experimental evidence on the relationship 
between associative inferences and cultural meanings, two approaches were 
selected using different criterion measures, The first approach was designed 
to explore how validly associative data reflect cultural meaning in general. 
The second explored the validity of associative inferences on a selected aspect 
of cultural meanings, attitudes, 


4, Inferences on Meaning Similarity 


The relationship of words expressed by intracultural associative affinity 
indexes was studied as a measure reflecting cultural meanings of meaning 
similarity, The literature shows the usefulness of similarity judgment tasks 
as criterion measures of meaning (2, 13, 42). 

Similarity judgments include both connotative and denotative meanings, 
and, as Flavell says, they provide a “natural and direct way” to approach 
meaning providing information on the personal perception of similarity. Simi- 
larity judgments express the relationship of words on the basis of their 
meanings as perceived by the subjects. 

The intracultural index of associative affinity (see Table 4) expresses the 
relationship of words based on the portion of associations they elicit in common. 

A comparison of the criterion measure reflecting meaning with the intra- 
cultural indexes reveals the extent to which associative similarity reflects. 
meaning similarity or meaning content. 

As a third measure expressing the relationship of words on the basis of 
their connotative similarity, Semantic Differential profile distance data wert 
included in the comparison. 

Pearson's product-moment formula (17) was used, and correlation coeffi 
cients were obtained on the relationships of associative affinity indexes, simi- 
larity judgment scores, and Semantic Differential distance scores on a large 
sample of word pairs (232-242) for each cultural group (see Table 6). 

The generally high positive correlations between associative affinity and 
similarity judgment show a close relationship between associative inferences 
and the criterion measure for meaning similarity. They are in basic agreement 
with data reported by Forster, Triandis, and Osgood (14) on the correlation 
of analogous associative overlap, Semantic Differential, and triadic similarity 
judgment on United States subjects. The high positive correlation suggests 
that the amount of overlapping associations reflects the degree of similarity 
or overlapping meaning. This finding supports the theoretical model of 
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TABLE 4 
THE INTERCORRELATION OF ASSOCIATIVE AFFINITY INDEXE, StMiLARITY Јооомахт 
SCORES, AND SEMANTIC DIFFERENTIAL PROFILE DurAwcES row тик 
Tuere CuLruraL GROUP 


Measures States Korean Colombian 
Semantic Differential-Associative Affinity 22 25 M 
Semantic Differential-Similarity Judgment 25 25 49 
Similarity Judgment-Associative Affinity 56 72 ЕД 
N (number of word pairs studied) 232 232 242 


.N (number of word pairs оа) ———_ 


meaning-mediated association process by suggesting that the greater associative 
overlap means more meaning elements in common. 


S. Inferences on the A ttitudinal-Evaluative Meaning Component 


Free verbal associations may be expected to offer valid information on any 
particular segment or component of the total meaning to the extent that they 
provide valid information on cultural meaning as a whole. Cultural attitudes 
were selected as a meaning component of special sociopsychological interest. To 
test whether the approach can measure the attitudinal-evaluative component, 
it was necessary to isolate attitudinal meaning components within the list and 
to translate them into attitude scores. These scores may then be compared with 
attitude data obtained from an accepted attitude scale. 

Attitudes are conceived as the products of personal evaluation, usually com- 
posed of positive and negative elements. The sum of positive and negative 
elements defines whether someone holds a positive or negative attitude. Simi- 
larly, group attitudes may be conceived as resulting from the overall summation 
of positive and negative value-laden meaning elements characteristic for the 
group members. Both direction and intensity of the attitude depend on the 
dominance of either positive or negative elements. The degree of co-occurrence 
of positive and negative meaning clements can be taken asa sign of ambivalence. 

This rationale, when applied to free verbal association, suggests a practical 
method for assessing attitudes. When responses with positive value content 
are identified as positive attitudinal elements, and responses with negative 
tive attitudinal elements, free verbal asso- 


value content are identified as nega E l 
ciations may provide information for the empirical assessment of attitudes. 


This objective requires the identification of positive or negative value-laden 
responses, their respective summation, and the expression of their relative 


dominance. ч 
* 
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As a first step, the responses of the groups were divided into positive and 
negative value-laden categories; the nonevaluative responses were discarded. 
Attitudinal indexes were calculated by the formula: 

positive valuation — 24 total valuation 


Attitude Index — X 100 
16 total valuation 


Where: Positive valuation is the sum of scores achieved by responses with 
positive value loading, and 


Total valuation is the sum of scores achieved by responses with 
positive and negative value loading. 


"This attitude index ranges between plus or minus 100. Its sign indicates 
the direction of the attitude and its numerical value reflects intensity. With 
this formula, two sets of attitude scores were calculated: the first used all 
responses and the second (to simplify the method) used only adjectives. 

To test the validity of these attitude scores derived from associations, the 
Semantic Differential was used as a criterion measure. The Semantic Differ- 
ential reported by Osgood and his associates (36) provides valid measures on 
attitudes. Attitude scores are generated averaging the scores on the evaluative 
scales (see Table 7). 


TABLE 7 
THE CORRELATION OF ATTITUDE INDEXES WITH SEMANTIC DIFFERENTIAL 
EvALUATION SCORES 


All value-laden . Evaluative 
Group responses (r) adjectives (r) N 
United States .93 .86 13 
Korean 96 .88 13 
Colombian .86 .81 13 


The high positive correlations of the attitude indexes with the Semantic 
Differential attitude scores support the assumption that associative group 
responses supply valid information on attitudes as elements of the total mean- 
ing content. 

"The results based on the use of adjectives are slightly inferior to the results 
based on the use of all value- or affect-laden responses. The correlations are 
still high enough, however, to indicate that the approach, when limited to the 
use of adjective responses, can be used as a fast method. 


F. POTENTIAL APPLICATIONS 


At least three broad fields of application of this technique of measuring 
cultural meaning may be noted. 
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The most important is in communication—that category of human behavior 
dominated by words and meanings. Critical differences in meanings between 
cultural, subcultural, or other defined groups may be identified; causes of 
communication disfunctions traced; and methods developed for selecting word 
substitutes or building meanings systematically on already existing elements. 

A second broad category of application is in analytic study of culture. As 
characteristic differences of the investigator's own culture may be easily iden- 
tified, Associative Group Analysis (AGA) data can be adapted to training for 
functioning in a foreign culture: i.e., to the teaching of culture, a practical 
objective emphatically enunciated by Hall (19). 

Potential intrasocietal applications are many, and results are already being 
obtained. Brent and Adams studied differences between pro-Johnson and pro- 
Goldwater college students (3). Carrol and Stark (6) studied the group 
meanings of delinquent and nondelinquent adolescents. 

In a third area of application, AGA may be used for the study of an individ- 
ual in relation to a group by measuring the degree of conformity of the indi- 
vidual to a group-specific associative response pattern. Since groups of any 
special research interest (e.g. schizophrenics, delinquents, immigrants) may 
be selected and appropriate response norms developed, a broad variety of 
diagnostic, rehabilitative, and acculturation studies could be anticipated. 

A series of critical problems (e.g. translation and equivalent stimulation, 
the influence of language, statistical significance) which could not be discussed 
here due to space limitations are elaborated elsewhere (43, 44). 


С. SUMMARY 


A comparative study of association material from three cultural groups 
indicates that free verbal association is useful in the quantitative analysis of 
group-specific meaning content, providing a new research technique called 
Associative Group Analysis. Continued free verbal association tasks are group 
administered in written form, and stimuli selected from the area of research 
interest are used. Intragroup and intergroup indices measure word relatedness, 
while group response lists reflect group-specific priority orders and give basic 
Meaning components through content analysis. The results are in substantial 
agreement with cultural data and show high positive correlation with two 
selected criterion measures. AGA has primary relevance for intercultural 
Communication efforts where group-specific word meanings are of paramount 
importance, 

REFERENCES 


l. Attrorr, F, H. Theories of Perception and the Concept of Structure. New York: 
Wiley, 1955. 


186 


17. 


24. 


26. 


JOURNAL OF SOCIAL PSYCHOLOGY 


Basmian, J. R. Response chaining in verbal transfer. Technical Report 13, Depart 
ment of Psychology, University of Minnesota, Minneapolis, 1956. 

Brent, J. Е., & Арлмз, M. An associative group analysis of frames of reference 
characterizing pro-Goldwater and pro-Johnson college students. Unpublished 
manuscript, The American University, Washington, District of Columbia. 

Brony, N. Anxiety and the variability of word associates. J. Abn. & Soc. Psychol, 
1964, 68, 331-334. 


Brunner, E. Rural Korea: A preliminary survey of economic, social and religious 
conditions. In The Christian Mission in Relation to Rural Problems, 1928, 
6(No. 2), pp. 73. 

Carrot, J. C., & Starx, R. A. Comparative analysis of frames of reference be 
tween groups representing societally approved and delinquent value systems. 
Unpublished Master's research paper, American University, Washington, Dis- 
trict of Columbia, 1965. 

Сілак, C. А. Religions of Old Korea. New York: Revell, 1932. 

Corsa, C. N. Associative commonality and rated similarity of certain words from 
Haagen's list. Psychol, Rep., 1957, 3, 603-606. 

- Comparison of word associations obtained by the methods of discrete 
single word and continued association. Psychol. Rep., 1958, 4, 507-510. 

Conex, B. H., BovserieLo, W. A. & WhirTEMARSH, С. A, Cultural norms for 
verbal items in 43 categories. Technical Report 22, Department of Psychology, 
University of Connecticut, Storrs, 1957. 

Derse, J. On the structure of associative meaning. Psychol. Rev., 1962, 09, 161-175. 

FLAVELL, J. Н. Meaning and meaning similarity: Theoretical reassessixcnt, J. Сел, 
Psychol., 1961, 64, 207-319. 

FLAVELL, J. H., & Fravett, E. R. One determinant of judged semantic and asso- 
ciative connection between words. J. Exper. Psychol., 1959, 68, 159-165. 

Forster, К. I, TRiANDIS, Н. C., & Оѕсоор, C. E. An analysis of method in research 
on the measurement of meaning. Technical Report 17, Department of Psychology, 
University of Illinois, Urbana, 1964. 

Ganskor, B. E, & Houston, J. P. Measurement of verbal relatedness. Psychol. 
Rev., 1963, 70, 277-288. 

Grirris, W. E. Korea: The Hermit Nation. New York: Scribner’s, 1882. 

GuiLFORb, Y. P. Fundamental Statistics in Psychology and Education. New York: 
McGraw-Hill, 1956. 

Haacen, C. Н. Synonymity, vividness, familiarity and associative value rating of 
400 pairs of common adjectives, J. of Psychol., 1949, 27, 453-465. 

Hatt, E. T. The Silent Language. New York: Doubleday, 1959. 

Hewes, С. W., & Km, C. H. Korean kinship behavior and structure. Unpublished 
manuscript, Human Relations Area Files; paper read at American Anthropologi- 
cal Association, Berkeley, California, 1950. 

JopELET, F. Vocabulaire et structure des associations de mots en chaine orales 6 
ecrites, Annee Psychol., 1960, 60, 349-370. 

Kane, Y. The Grass Roof. New York: Scribner's, 1931. 

Km, A. C. Village life in Korea. J. Geography, 1937, 36, 315-320. 

Кі0скнонх, C. Mirror for Man. New York: McGraw-Hill, 1949. 

LAFFAL, I. & FELDMAN, S. The structure of single word and continuous word 
associations, J. Verb, Learn. @ Verb. Behav., 1962, 1, 54-61. 

MEUS The structure of free speech. J. Verb. Learn. & Verb. Behav., 1963, 9 


Lez, H. K. Land Utilization and Rural Economy in Korea. Chicago: Univ- 
Chicago Press, 1936. 


LORAND В. SZALAY AND JACK Е. BRENT 187 


D Lacrrxs, L. Н, et al. US. Army Handbook for Colombia. Washington, D.C.: 
М Foreign Area Studies Division, SORO, The American University, 1944. 
B. Lrrxer, M. America as a Civilization. New York: Simon & Schuster, 1957. 
| Marsuatt, С. R., & Corer, C. N. Associative indices as measures of word-related- 
" "mess: A summary and comparison of ten methods. J. Ferb. Learn, & Verb. Behoe., 
1963, 1, 408-421. í 
П. Moose, J. R. Village Life in Korea, Nashville: M. E. Church, 1911. 
- Ку, О. H. The psychologist looks at language. Amer. Prychologist, 1954, 9, 
3& Noster, C. E. An analysis of meaning. Psychol. Reo., 1952, 54, 421-430. 
И. Oscoop, CuanLEs E. Meaning cannot be ra? J. Ferb, Learn. О Verb’. Behav, 
С 1966, 5, 402-407. 
Oscoon, Снлкикѕ E., SUCI, G, & TANNENBAUM, P. H. The Measurement of 
Meaning. Urbana, Illinois: Univ. Illinois Press, 1957. 
6. Oscoop, ConNzLIUs. The Koreans and Their Culture. New York: Ronald Press, 
1951. 
Reorieiy, M. The American family: Consensus and freedom. Amer, J. Seciel., 


1946, 52(No. 3), 175. 

$& REICHEL-DOLMATOFF, G., & ReiCHEL-DOLMATOFF, A. The People of Aritama; The 

oo pr Personality of a Colombian Mestizo Village. Chicago: Univ. Chicago 
ess, 1961. 

L Russert, W. A., & Jenxins, J. J. The complete Minnesota norms for to 

100 words from the Kent-Rosanoff чаб association test. rr eal эон" 2, 

Department of Psychology, University of Minnesota, Minneapolis, 1954. 


0. Sarason, I. С. Relations of measures of anxiety and experimental instructions 
word association test performance. J. Abn. & Soc. Psychol., 1959, 89, 37-42. 
& Kors, D. Task attitudes in word associations, pro- 


© jection and non-projection. J. Abn. & Soc. Psychol, 1955, 33, 441-447. 
42. Szalay, L. B. The semantic structure of verbs. Z. Exp. Angew. Psychol., 1962, °, 


140-163. 

. Cultural meanings and values: A method of empirical assessment. 
Technical Report, The pecie University, Special Operations Research Office, 
Washington, District of Columbia, s E oe adio 
Szatay, L. В, & WixpLE, C. The relative influence versus 
factors on free verbal associations. Faper Dew at Annual Meeting of the 
American Psychological Association, New York, New York, 1966. ; 
Ware, R. K. Socialism and capitalism: An international misunderstanding. 
Foreign Affairs, 1966, 44, 216-228. 
Wuutams, R. M., Jr. American Society: 
Knopf, 1952. 


iter for Research in Social Systems 
American University 

0 Wisconsin Avenue, №. W. 
ishington, D.C. 20016 


AL. Sueora, E., WALKER, N. W., 


A Sociological Interpretation, New York: 


| 


The Journal of Social Psychology, 1967, 72, 189-195. 


SOCIAL EPITHET IN INDIA: A HINDI EXAMPLE* 


Department of Sociology, Indiana University 


ALLEN D. GniMsHAW! 


A. INTRODUCTION 


Studies of stereotypes have generally consisted of frequency distributions 
of respondents’ agreements on listings of discrete negative evaluations of per- 
sons belonging to social categories. Attention to positive attributes has been 
minimal and usually residual; attempts to identify underlying structures that 
integrate the variety of negative choices into core characterizations have not 
generally appeared in the literature. Given cultural and personality propen- 
sities for integration, it seems most unlikely that the assignment of negative 
and positive attributes to social categories is done in a nonsystematic manner. 

Most studies of stereotypes have examined either attitudes or beliefs about 
minority ethnic groups within nations or have attempted to identify cross- 
national stereotypes. India, as a nation state, is constituted of a number of 
sharply varying cultural and linguistic subunits. In a pioneering study of com- 
munication in India, Bower and his colleagues remarked: 


In a sense, the relations of one provincial group in India with another 
may almost be thought of in a context of international, or at least inter- 
cultural, relations, The fact that different regional groups speak different 
languages, have their distinct caste. variations, have their own cultural 
traditions, has meant that to many Indians a man from another province 
can seem almost as foreign as a man from another country. 'The common 
features of religion and government (and among the better-educated, 
of English language) cannot entirely eliminate a regional ethnocentrism 


(1, p. 47). 


The widespread use of regional and other stereotypes is a feature of Indian 
life obvious to even the most casual visitor. A scientific examination of epithet 


in India can provide both an illumination of an important variety of social 
—— 
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behavior and, at the same time, perspectives on sources of social strain and 
group conflict which will be of usefulness to social planners. 


B. PURPOSE 


The purposes of this study were (a) the testing of the widely held assumption 
that there are differentiated characterizations of people native to the several 
regions of India, (Б) the identification of such characterizations if they existed, 
(c) the determination of whether single items might accurately identify 
attitudes toward social categories or whether, instead, characterizations con- 
sist of sets of interrelated attributes, and (d) the determination of the useful- 
ness of a modification of the semantic differential (4) and of factor analysis in 
research on social epithet. Only Hindi epithets were studied; it was not a 
purpose of this research to examine reciprocal sets of characterizations. 


C. METHOD 
1. Subjects 


The subjects were 25 male Indian nationals enrolled as graduate students 
at Indiana and Purdue Universities. Only students who had Hindi as а 
“mother tongue” participated; the bulk of the subjects came from Uttar 
Pradesh, in the heart of the Hindi-speaking area. 


2. Materials? 


The instrument used resembles Osgood’s “semantic differential,” but differs 
sharply in what it attempts to measure, Osgood and his colleagues list several 
criteria for the use of linguistic encoding as an index of meaning: 


(а) a carefully devised sample of alternative verbal responses which can 
be standardized across subjects, (b) these alternatives to be elicited from 
subjects rather than emitted so that encoding fluency is eliminated as a 
variable, and (c) these alternatives to be representative of the major 
ways in which meanings vary [(4, р, 19) emphasis in the original]. 


The instrument used has as its alternative responses 40 negative epithets and 
their closest antonyms separated by seven-point scales. Thus Osgood is at 


2 Material supplementary to this article, including rough English equivalents (0 
terms used in the instrument and variable numbers, means, names, and loadings, by 
regional label, has been deposited as Document number 9400 with the ADI Auxiliary 
Publications Project, Photoduplication Service, Library of Congress, Washington 
D.C, 20540. A copy may be secured by citing the Document number and by remitting 
$2.50 for photoprints, or $1.75 for 35-mm microfilm. Advance payment is requiret 
Make checks or money orders payable to: Chief, Photoduplication Service, Library 


of Congress, Additional materials, including the Original schedules, may be obtained 
direct from the author. 
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tempting to isolate the meaning of words, whereas we are attempting to iden- 
tify characteristic ways of perceiving members of significant social categories. 

The terms used were administered only in Hindi, and were initially selected 
from widely read children’s stories with supplementary terms added to make 
the listing consonant with an international relations conception of the meaning 
of stereotype. Considerable difficulty was encountered in finding positive epi- 
thets with the fine shades of meaning attached to the negative terms.’ In 
several instances there are no English equivalents that can convey the full 
richness of the Hindi epithet. The 40 pairs of terms were repeated for each of 
10 regional designations (Gujerati, Rajasthani, Punjabi, Delhiwallah, UP- 
wallah, Bihari, Bengali, Madhya Pradeshi, Madrasi, Maharashtrian), making 
400 judgments for each respondent. 


3. Procedures 


1. The respondents were not asked to identify their home region because 
it was felt that they might resent an American prying into things that con- 
cerned Indians or might in their sophistication deny that stereotyping existed 
in India, The project was thoroughly explained by the investigator and an 
Indian assistant. Subjects were instructed to give not their own reactions but 
those which they believed to be typical in their home areas Apparently these 
precautions were not necessary. All subjects fell to the task with great en- 
thusiasm—in some instances giving gratuitous examples to prove the truth of 
the stereotype—and the results obtained give overwhelming evidence, even 
with a limited N and with more response categories than respondents, of the 


existence in India of definite patterns of regional stereotype. Я 
2. Responses on each item were given arbitrary weights of one throug 
seven,® and means were calculated for each item for each region. These scores 


were used in factor analyses of each of the several regions. 
__ 
imi i i ber of 
8 Preliminary inquiry reveals that a similar disproportionately large num 
Negative, as Lie A positive, terms is characteristic of Yiddish, German, French, 


Spanish, Russi Bulgarian (as well as English), and may possibly be common 
in all Таша d existente i such imbalance, if it is indeed a fact, might help 
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in explanation of a wide variety of thought and behavi е i" ic 
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D. Resuits AND Discussion 


Tn any set of 40 choices concerning individual attributes there is, inevi 
some redundancy. Moreover, people tend to perceive members of social cate 
gories as sharing clusters of interrelated characteristics: viz., the characteristics 
of pessimism, dishonesty, provincialism, courage, and progressivism will not 
frequently be assigned to the same individual or group. The purpose of factor 
analysis is to identify the one or several underlying factors to which a series 
of intercorrelated variables are related. Thus, through factor analysis an at 
tempt is made to explain the intercorrelation of several variables (here the 
scores on each item over all subjects against scores on another item over all 
subjects) by their varying strength of association with one or several common 
hypothetical factors (it is possible, of course, that the individual variables are 
not unidimensional but composed of several components, each of which may 
be related to a different factor). Through use of this method it is possible 
to make manageable the otherwise overwhelming task of analyzing the mean 
ing of an extraordinarily large number of variable by variable correlations. 

The proportions of total variance (within the set of 40 intercorrelated 
scores) explained by the first and subsequent factors—until 70 per cent had 
been explained or until increments were less than 5 per cent—were calculated 
for each of the 10 regions. While all those variables with loading of .50 oF 
more were listed for each factor, only those of .60 or more were included 
in the analysis. Variables for the several factors were independent in most 
instances; only two exceptions in one region were not totally independent at 
the .60 loading in the first two factors. 

The within-region combined means give a rough ranking of overall assess 
ments of persons from the several regions. This ranking, shown in Table ly 
is congruent with other studies and with what appear to be widespread eX- 
pressions of attitudes in North India [eg. (3)]. 

Many of the initial factors are composed of 10 or more variables and defy 
simple interpretation. Close scrutiny reveals, however, that several of the 
variables in each of these factors, as in the higher order factors, are closely 
related, that fairly cogent descriptions of the factors are possible, and that the 
factors are consonant with the complex characterizations of residents of the 
several regions, which are current in the Hindi-speaking areas of India. 

Only a sampling of these factors is reported here. Given the fact that all 
of the schedules were filled out by Hindi speakers and, therefore, presumably 
by North Indians, one is not surprised at the generally negative characterization 
of Bengalis and Madrasis reported in Table 1. In both cases, the initial factors 
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are composed of a collection of negative epithets, combined with a minimum 
of partly redeeming or neutral labels. Factor I for Bengalis (21.421 per 
cent of the variance), for example, characterizes people from this region as 
timid, devious, and crafty flatterers, given to empty threats and redeemed 
only by a marginal honesty and minimum generosity.* In addition, this group 
is seen (Factor III, 10.730 per cent of the variance) as provincial, cliquish, 
eccentric, and submissive. Madrasis are characterized (Factor I, 20.022 per 
cent of the variance) as provincial, selfish, garrulous, and cliquish people who 
combine simplicity with intelligence and also (Factor III, 12,882 per cent 
of the variance) as cowardly, unreliable frauds. Maharashtrians, on the other 


TABLE 1 
Compinep MAN $совко sy Rectox* 
Regional designation Combined mean 
Bengali 3.91 
Madrasi 3.93 
Punjabi 445 
Gujerati 451 
Rajasthani 4.56 
Delhiwallah 4.60 
Madhya Pradeshi 476 
Maharashtrian +4 
Bihari 4.85 
UPwallah 4.95 


а Since sampled attitudes are neither independent nor from probability samples, 
differences of meats tests are not applicable, It seems unlikely, however, with а range 
of from 1.0 to 7.0 and with only 25 respondents, that differences as great a» those 
reported can easily be attributed to chance. 


hand, are characterized (Factor I, 18.382 per cent of the variance) as bold, 
straightforward, hardworking, and reliable optimists. The factors for Punjabis 
reflect the apparent ambivalence that many Indians have about this colorful 
group. They are characterized at the same time (Factor I, 25.142 per cent of 
the variance) as innocent, good, generous, and honorable and as cliquish, 
fraudulent, and selfish. Factors II and III, respectively, represent other facets 
of the stereotype of this group when Punjabis are characterized in the first 
instance (Factor II, 14.828 of the variance) as quick to anger, rash, lawless, 
obstinate, and boastful and in the second (Factor III, 10.985 of the variance) 
as bold (6.46 and 6.54 on two closely related choices), straightforward (5.54), 
hardworking (6.23) men of their word (5.58). Examples could be multiplied 


6 There is surely no need to emphasize ће proximate nature of A dare ry ign 


lations, Shades of semantic differentiation in Hindi appear to 
ose in the closest English approximations. 
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but one further instance must suffice. Gujeratis are characterized (Factor I, 
27.340 per cent of the variance) as honest, just, quiet, and kind; but, at the 
same time, as selfish and unreliable in crises. Somewhat more consistent is 
Factor II (13.334 per cent of the variance) in which they are labelled as 
cowardly flatterers, given to empty threats. 

Respondents for this study did not constitute a random sample and might 
have been expected, because of their advanced educational standing and self- 
identifications as “ambassadors,” to resist making negative characterizations of 
whole categories of their countrymen. The success of the research in delin- 
eating factors of clustered epithets in spite of the small sample and of defects in 
the instrument underlines the potential value of the use of a modified semantic 
differential and of factor analytic techniques in research on the complex prob- 
lems of intergroup and interregional attitudes and behaviors. 


E. SUMMARY 


The purpose of this study was to test the utility of a modified semantic 
differential and of factor analysis in the investigation of interregional imagery 
in a nonrandom sample of male, Hindi-speaking Indian nationals temporarily 
resident in the United States. The principal findings were as follows: 


1. Mean judgments over 40 positive-negative items produced a ranking of 
regions consistent with findings of previous research and with generally re- 
ported characterizations. 

2. Factors for individual regions are complex, indicating that stereotypes 
are multidimensional. Factors were, nonetheless, consistent with previous 
research in their general characterizations of residents of the several Indian 
regions identified in the instruments. 

3. There is some evidence that, while languages may vary in ratios of posi- 
tive to negative epithet, the latter constitute a disproportionate part of the 
total lexical inventory in all languages. 
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A. 1. Rapin AND Joserina А. LiMUACO 


A. INTRODUCTION 


Cultural anthropologists and ethnographers have recognized, long ago, the 
difficulties in communication involved in cross-cultural research. Investigators 
have often turned away from the structured type of personality questionnaire, 
so frequently employed in personality studies in Western cultures, to the 
unstructured variety of methods known as projective techniques (1). Here, 
the linguistic input in the testing situation is usually at a minimum, but the 
output of the respondents to the unstructured stimuli is of utmost importance. 
The interpretive superstructure concerning personality rests heavily upon the 
telatively free verbalization of subjects in response to the nonverbal stimuli 
with which they are presented in the testing situation. у 
However, this substitution of nonverbal stimuli, which cannot be miscon- 
strued or distorted in their translated form, is not without its problems and 
complications. One of the basic questions that arises is the one of equivalence of 
stimuli: Do stimuli, even of the unstructured (and nonverbal) variety, have 
the same meaning for persons of different cultures? 

Of course, the issue of structure or its absence is not quite simple. All of the 
projective stimuli have some structure; the Thematic Apperception Test 
(TAT) cards, for example, which portray specific social situations involving 


persons in Western dress can be said to be more structured than the Rorschach 


inkblots which do not objectively portray specific items. We selected the latter 
cross-cultural 


method, specifically the 10 Rorschach stimulus cards, for close 
scrutiny. 
B. METHOD 

In a study of 66 college students in the United States, one of the еен 
authors (3) reported fairly clear connotative meanings of poss М in! m 
by means of the semantic differential (2). The semantic differential consts 
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of a series of scales which are defined by polar adjectives: i.e., adjectives that 
are opposite in meaning to each other. The poles are separated by seven spaces, 
one of which may be checked in order to indicate the position of the stimulus 
rated. The following is an example of such a scale: 


Good — — — Bad 


A 20-item semantic differential was checked by the Ss in response to each 
of the 10 Rorschach inkblots which were projected, one at a time, on a screen. 
Responses were dichotomized: i.e., those placed on either side of the midpoint 
of the scale. The obtained distributions were compared with the 50-50 chance 
distribution. Thus, if the vast majority of our subjects, for example, checked 
the spaces on the left side (see scale above) of the midpoint, in response to а 
specific inkblot, and the distribution is significantly different from the chance 
expectancy, the stimulus is considered as “good.” Conversely, if the weight of 
the actual distribution of responses is on the right and a significant chi square 
is obtained, then that particular Rorschach card is considered as "bad." How- 
ever, if an obtained distribution does not differ significantly from chance 
expectancy, the particular adjectives involved do not enter into the determin- 
ation of the connotative meaning of the Rorschach card under consideration. 

Of 200 comparisons (20 "semantic differential" scales X 10 Rorschach 
inkblots) involving the adjectival polarities of the semantic differential, 112 
differ from chance at the .05 level of confidence or better. Similar studies 
with other American samples, produced results in essential agreement with 
our original findings (4). 

The present study reports the responses of 43 Filipino college students to 
Rorschach's inkblots by means of the same selected 20 scales of the semantic 
differential. In addition, the results of the Filipino sample are compared with 
those obtained with the American students, reported in 1959 (3). The same 
statistical procedure has been followed; responses on the seven-point scales 
were dichotomized and the significance of the differences between the obtained 
distributions and the theoretical chance distribution, of 50 per cent on either 
end, was determined by means of the chi square statistic. 


C. RESULTS 
Of the 200 comparisons, 79 turned out to be significant at the .05 level of 
confidence or better. 
These results are reported in Table 1. It is interesting to note that 51 of tht 
79 significant differences were obtained on the same scales and to the same 
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Eacu PoLaR END IN JUDGING RORSCHACH CARDS 


Adjectives 


IV 


Rorschach cards 


VI 


Ex Ea Ban аганы) ыя 
SSRUISRFSRES Sen sere 


good 
large 
beautiful 
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pleasant 
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fast 
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TABLE 1 (continued) 


I 


II 


ш 


ІУ 
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Rorschach cards 
VI 


ҮП 


УШ 


ІХ 


x 


ot c once зә 


bad 

small 

ugly 

weak 
dirty 
distasteful 
worthless 
cruel 
unpleasant 
sad 
peaceful 

. light 


‚ hazy 


cold 

. thin 
dishonest 
. passive 
smooth 

. slow 
delicate 


++ 


** 


* Significant at less than .05. 
** Significant at less than .01. 
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cards as were reported on the American sample, referred to above. However, 
61 of the significant differences that resulted from the American data did not 
appear in the Filipino findings; also, 28 differences based on the Filipino 
results did not appear in the American data. Thus, it would seem that there 
is a good deal of commonality of meaning of the Rorschach stimuli for both 
cultural groups, but there is considerable divergence as well. It certainly 
cannot be stated that the inkblots have the same connotative meaning for 
different cultural groups, since the divergence is so marked. In only five 
instances, however, was there a reversal of polarities, Whereas, for instance, 
Card II was described by the Filipinos as clear," it was considered “hazy” by 
the Americans; Card VII is considered by the Filipinos as active" as compared 
to the “passive” adjective on which there was marked American agreement, 
etc, Generally, as is the case of the earlier study, no two cards were described 
in exactly the same terms, although some overlapping in the selected adjectival 
polarities does occur. 

Another set of comparisons involves sex differences. As in the study with 
the American subjects, the data for the two sexes were also treated separately 
in the present instance. The Filipino sample contained 20 males and 23 females. 
A comparison of the number of significant chi squares obtained with both sexes 
in the two cultural groups appears in Table 2. For the American groups 67 
and 86 significant chi squares were reported for the males and females, 
respectively (112 for the combined sample). For the Filipinos 61 and 79 
significant chi squares were obtained for the males and females (79 for the 
combined sample). Detailed information is included in Table 2. 


D. Discussion 


Several additional observations are suggested by the data. First, the rank 
order of the cards with respect to the meaningfulness of the inkblots differs 
considerably for both samples (see Table 2). Whereas Card I is most mean 
ingful (ie., greatest number of agreements on meaning) for the Filipinos, it 
ranks toward the middle for the Americans; conversely, Cards IV and vil 
show greatest agreement in meaning for the Americans, but much less so for the 
Filipinos. The coefficient of correlation between the ranks obtained for the 
two samples is not significant statistically (rho = .48). 

The second point concerns sex differences. In the American sample, ther 
seems to be a high degree of congruence between the sexes with respect to the 
meaningfulness of the cards. Those cards that showed agreement on a larg? 
number of scales for the females also elicited similar reactions from the males. 
The rank differences correlation between the sexes as to the meaningfulness © 
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the different cards yielded a rho of .901. However, a similar correlation for 
the Filipino groups resulted in a coefficient of .01. Thus, the agreement between 
the sexes for the American sample is very high, whereas for the Filipino sample 
the agreement is not. It would appear that Filipino males and females differ 
markedly with respect to the extent of the connotative meaning of the visual 
stimuli to which they were exposed, while the degree of meaningfulness of the 
inkblots is very similar for males and females in the American culture. 

This differential trend between the sexes in the Filipino sample as compared 
with the American subjects finds its explanation in an earlier study (5) of the 
present authors, in which they noted marked differences between Filipino and 
American children with respect to sexual differentiation. These differences 
were related to contrasting child-rearing patterns of boys and girls in the two 
cultures. Apparently the differences in the early cultural experience affect modes 
of perception and association as reflected in the connotative meanings attributed 
to unstructured stimuli, such as Rorschach’s inkblots. 


E. SUMMARY AND CONCLUSIONS 


Forty-three Filipino college students, 20 males and 23 females, checked 20 
items of the semantic differential on a seven-point scale for the 10 Rorschach 
inkblots. The data were dichotomized, and the significance of the differences 
in the choice of each polar adjective was obtained by means of the chi square 
technique. For the combined group, 79 (out of 200) chi square values significant 
beyond the .05 level were obtained. The number of significant chi squares for 
the males and females, treated separately, were 61 and 79, respectively. ‘These 
data were compared with results obtained on an American sample, reported in 
a previous paper (3). The comparisons between the two cultural groups 
justify the following: 


1. In general, the inkblots were more consistently meaningful for the 
American students (112 significant chi squares compared with 79 for the 
Filipinos). 

2. Since only 51 of the chi squares overlap for both samples and 61 and 28 
appear exclusively in the American and Filipino results, respectively, it my 
be concluded that along with some congruity in meaning there is a wide area 
of divergence. 

3. There is a rather high congruity, in the relative connotative meaning of 
the cards, between the sexes in the American sample. No such congruity 
between the sexes was obtained for the Filipinos. 
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The results were discussed in the light of previous research, and related 
to the possible differences in the childhood experiences of the two sexes, in the 
two cultures involved. 

Finally, if the connotative meaning of Rorschach's inkblots differs from 
culture to culture (as seems to be the case in the present instance) and if the 
meaning has an effect upon the responses and associations produced in the 
testing situation, then the practicality and justifiability of the universal appli- 
cation of Rorschach interpretive procedures in cross-cultural research may be 
seriously questioned, 
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PERFORMANCE OF TANZANIAN SECONDARY SCHOOL 
PUPILS ON THE RAVEN STANDARD PROGRESSIVE 
MATRICES TEST** 


University College, University of East Africa, Dar es Salaam 


E. L. KLINGELHOFER 


A. INTRODUCTION 


Raven’s Standard Progressive Matrices Test (18)—henceforth PM—has 
been extensively used by individuals carrying out psychometric research on 
the African continent [cf. (6, 8)]. It has been administered under somewhat 
varying conditions in South Africa (16), West Africa (1, 15), the Congo 
(2, 3, 10, 12, 17), Zambia (11), Southern Rhodesia (5) and Uganda (19). 
Its popularity as a test probably stems from the fact that it is relatively 
simple to administer in a group setting, it represents an interesting set of 
tasks to the examinee, it is not tied to any particular language for purposes 
of administration and thus has utility in a variety of cultural settings, it has 
been shown to be a relatively pure measure of Spearman's general factor, and 
it has proved useful as a predictor of scholastic performance. 

To secure some descriptive information about secondary school pupils and 
to relate performance on the PM to other characteristics of the examinee— 
notably his sex, his age, his tribe (in the case of African students) or com- 
munity (in the case of Asians), and his educational Jevel in school—the test 
was administered to a large sample of secondary school students in "Tanzania. 
In addition the project provided some information bearing on Irvine's con- 
tention that 


As societies become more developed and industrialized, so their school 
systems become more homogeneous. Consequently (a) the effect of school 
quality on test performance will decrease in direct proportion to the in- 
crease in standards of teaching and school amenities and (b) the relation- 


ships of test scores to environmental factors will more resemble those of 
western societies (7, р. 30). 


* Received in the Editorial Office, Provincetown, Massachusetts, оп March 8, 1967, 
and given special consideration in accordance with our policy for cross-cultural re- 
search, Copyright, 1967, by The Journal Press. 

1 This report grows from а project supported by a grant from the Faculty Research 
Funds of the University College, Dar es Salaam, to which grateful acknowledgement 
18 given. 
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B. PROCEDURE 


During May and June, 1966, the PM was completed by 3692 students in 
31 secondary schools in mainland Tanzania. The tests were administered by 
education students at the University College, Dar es Salaam, who were 
practice teaching in the secondary schools and who had been given special 
training for the project. They gave the tests to students in the classes they 
were teaching so that some of the problems growing out of test administration 
by an expatriate were avoided [с]. (4, p. 460; 13)]. Two modifications were 
introduced into the ordinary routine of test administration. First, the directions 
were changed so that the examinee was given detailed coaching and help in 
the solution of the first five problems of Set A of the matrices. Our objective 
here was to try and insure that the examinee understood the test and the 
somewhat unfamiliar procedures for recording his responses on a separate 
answer sheet. Second, we imposed a time limit of 30 minutes on the test 
(after completion of the first five exercises) because it has been found that scores 
on the test given in this period of time (and under the conditions of familiariza- 
tion we used) were more variable and reliable than when given under a number 
of alternative arrangements (5, 19). The time limit was imposed for two 
additional reasons, however. First, with the time limit it was possible for the 
students to complete the test in one class period, and the disruption to school 
time tables was thus held to a minimum. Second, the imposition of a time 
limit (a nonstandard condition) precluded some of the kinds of comparisons 
that have marked the psychometric literature. 

After being administered the tests were scored and the individual score 
sheets were sorted by sex, age, form level, and by African tribe or Asian 
community.? Then the scores were subjected to a series of simple analyses of 
variance. In evaluating these analyses we adopted a 1 per cent coefficient 
of risk. 

It is appropriate to remark that the age data were supplied by the student 
and the problem of age determination is one of the more important obstacles 
to psychological research in Africa. The students reported their ages at tW? 
different times on two different forms, and these two figures were compared 
and adjudicated where this was necessary. It seems likely that the ages reported 
by the Asian groups are reasonably accurate, since most Asian families 216 
literate and follow the custom of observing birthdays so that a student could be 
expected to know and remember his date of birth. Among the African students 


2 Brief descriptions of the African tribes and Asian communities dealt with in this 
report have been filed with the American Documentation Institute. A map locating 
the tribes is also provided. See Footnote 3 in Klingelhofer (9) for ADI numbers. 
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this is not the case. A substantial number of the latter reported their age as 
“about 16” or “approximately 18” or “I think I was born in 1949," etc. The 
chances of getting accurate information are better if the child is born of 
literate parents or if he is born into a Christian family (since this will involve 
baptismal ceremonies and records), but a sizeable proportion of the African 
boys and girls in the sample come from non-Christian homes and the great 
majority of the African parents are illiterate. There is, in addition, the social 
fact that education is ardently sought by youth in developing countries. ‘The 
number of places in secondary school in these countries is extremely limited 
and, in Tanzania, eligibility for those places is determined by a competitive 
examination at the end of the primary school. The African student who fails 
the examination may go to another part of the country, live with relatives, 
re-enroll in upper primary school and retake the examination. In order to do 
this he is likely to change his name and adopt an age that is compatible with 
his educational level rather than his chronology, if it is assumed that he knew 
the latter in the first place. While this may seem somewhat untidy, the 
changing of names is a common practice associated with changes of status in 
the traditional society (from which most of the African secondary school stu- 
dents derive and which still plays a meaningful part in fixing some behavior 
and determining attitudes). Because of these factors we are inclined to think 
that the African ages, while accurate indications in the majority of instances, 
probably understate the true ages of the individuals to an unknown extent 
in the aggregate. 

Finally, it should be pointed out that the students were not randomly 
selected from all Tanzania secondary schools? However, the sample contains 
approximately one-fifth of all students enrolled in the first three forms of 
secondary school in the country, and all regions or the country are represented 
in the sample; so it is reasonable to assume that the results we have probably 


reflect what is true of the population as a whole. 


C. RESULTS 
1. Test Performance by Tribe or Community 


The various African tribal subgroups have had attributed to them various 
characteristics although we have argued elsewhere (9) that among secondary 
school pupils these differences, if they exist at all, are likely to be obscured 


EE oou . H 

3 The educational structure in Tanzania is English-inspired. Primary school embraces 
the first seven years of schooling (Standards I-VII). Secondary school consists of a 
Years of postprimary education; the first four years (Forms 1-4) culminate in the 
Cambridge ordinary level examinations. The candidate successful in these goes on to 
Forms 5-6, Our sample consists entirely of students in Forms 1-3. 
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through the influence of the rigorous selection process and the homogeneity 
of the experiences undergone by the students as students. Table 1 summarizes 
the results of simple analysis of variance tests applied to each Form level 
for each sex and ethnic group over the tribes or communities comprising the 
larger group.* 

It will be seen that only one of the 16 separate analyses of variance produced 
a value for F, which is significant at the 1 per cent level, this for Form 2 
African boys. A detailed analysis of the pairs of differences between means 
evaluated by means of f tests reveals that the significant F ratio is probably 
associated with an extremely high average level of performance for the 
Nyamwezi tribe and a low average level of performance for the Nyakyusa 
in the Form 2 sample. Of the 15 pairs of mean differences (out of the possible 
276) that were found to be significant at the 1 per cent level, 11 were asso- 
ciated with these two tribes. This finding will meet with discussion later. 

It is interesting to note that when the form levels are combined none of 
the groups (African or Asian, boys or girls) shows any differences in mean 
performance over subgroups (tribes or communities) that cannot be accounted 
for by chance. However, the composite means for the ethnic groups (e£ 
Form 1 African boys versus Form 1 Asian boys) show marked differences. 
When the sex and form equivalents of the ethnic groups are compared, there 
are significant differences in mean level of intelligence (as this is reflected in 
performance on the PM) in favor of the Asian groups at all levels and for 
both sexes. 


2. Test Performance by Form 


In a second series of analyses, each subgroup (tribe or community) Wa 
studied to determine if there was any trend in mean scores from Form 139 
Form 3. The results of this analysis were largely negative in the sense that 
significant F ratios were found only for Nyamwezi and Hindu boys. The first 
of these findings was alluded to earlier. The Nyamwezi showed a markedly 
high mean in Form 2. Among the Hindu boys there seems to be a high mean 
score for Form 3 students. 

When the mean PM scores for the sexes are compared within form levels 
over ethnic groups (e.g., African Form 1 boys versus African Form 1 girls 


4 Detailed tables reporting Ns, means, and standard deviations for the tribes g 
communities, forms, and ages by sex and ethnic groups have been deposited with t 
American Documentation Institute as Document number 9404, A copy may be secure 
by citing the Document number and by remitting $2.50 for photoprints, or $175 Бу 
35-mm microfilm, Advance payment is required. Make checks or money orders pay? 
to: Chief, Photoduplication Service, Library of Congress, Washington, D.C, 20540. 
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TABLE 1 
SUMMARY OF ANALYSES OF VARIANCE OF RAVEN STANDARD PROGRESSIVE Matrices Scores 
or TANZANIAN SECONDARY SCHOOL STUDENTS GROUPED ACCORDING ТО ScHOOL 
Form LEVEL, SEX, AND AFRICAN TRIBE OR ASIAN Community 


Degrees of 

Group, form, Sum oF freedom Mean square 

and sex N x squares Tribe Error Tribe Error F 
African boys, 

Form 1 930 34.1 94160 22* 906 133.6 100.7 1.33 
African boys, 

Form 2 946 34.8 90553 23 922 217.0 92.8 239 
African boys, 

Form 3 249 33.3 26609 20* 227 49.0 112.9 AS 
African boys, 

combined 2125 34.3 211934 23 2101 173.5 99.0 175 
African girls, 

Form 1 i 430 34.1 45926 11 418 175.1 105.3 1.66 
African girls, 

Form 2 370 33.8 39385 11 358 141 10.6 1.33 
African girls 

Eos $ v 38 38.0 2154  3* 34 255 76 1.33 
African girls 

combined і 838 34.1 88276 11 826 129.3 105.2 1,23 
Asian boy: 

Form т” 201 43.1 11248 3 197 86.1 $5.8 1.54 
Asian b 

Form 7a 148 44.2 7424 3 144 23.4 51.1 A6 
Asi 

dus Ж 66 454 52722 3 &2 1718 767  22* 
Asian boys, 

combine 415 43.9 24187 3 411 1.0 58.8 02 
Asi ; 

DR. 129 420 10259 3 125 194 774 2.52 
Asi i 

DU 140 414 128221 3 136 1328 — 919. 145 
Asi i 

pon 45 41.9 3991 3 41 3.3 94.7 03 
Asi i 

Ru 314 41.7 27098 3 310 159.7 86.1 1.85 


* Indicates that one or more of the subgroups (tribe or community) had one or 


no representatives for the form and sex analyzed. i 
** Indicates F to be significant at the 1 per cent level. 


ete.) by applying 7 tests, it is found that the observed differences ате not 
Meaningful for African boys and girls except in Form 3 where the girls sn 
of 38.0 is significantly greater (Difference — 4.7; SEaisjeronce == 1.76; = 
2.67) than that for boys. ‘The same comparisons carried out for Asian boys 
and girls reveal that the boys earn significantly higher mean PM scores in 


Form 2 and for all forms combined. 
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3. Test Performance by Age 


When it was found that the tribe or community designations and form levels 
were largely independent of performance on the PM, the individual scores 
were lumped and analyzed, by sex and ethnic group, over age in years. These 
data are presented in Table 2. It will be seen that, while there is a tendency for 
mean PM scores to decline with age, for the girls this effect is not statistically 
meaningful. Both groups of boys show a significant F ratio when PM scores 
are analyzed over age. A detailed analysis of the differences between pairs of 
means for each of the groups was carried out according to the method suggested 
by McNemar (14, p. 259). For the African boys, the general finding seems to 
be that differences between pairs of means within the age groups 13-17 and 
between 18-21 or more are not significant, but that differences between pairs 
drawn from these two age brackets (1.е., 13-year-olds versus 18-year-olds, etc.) 
are significant. There is evidence of a decline in mean PM score as a function 
of age in years for African boys and this decline is uninterrupted until the 
oldest age bracket (21 or more) is reached. For the Asian boys the significant Е 
ratio seems to be associated with а low mean score for the individuals in the 17 
years or older age bracket. In this context it should be noted that in Table 2 
the last age carrying frequencies for any group contains all individuals older 
than that age. 

TABLE 2 


MEANS AND ANALYSIS OF VARIANCE OF RAVEN STANDARD PROGRESSIVE MATRICES SCORES, 
BY ÅGE IN Years, For Four Groups OF TANZANIAN SECONDARY SCHOOL STUDENTS 


Boys Girls 
African Asian African Asian 

Age M tU КОЮУ SES apo N уйщ 
13 18 40.2 8 82 443 19.9 8 38.0 1.0 57 432 18.1 
14 91 38.2 43 103 444 24.9 50 36.9 6.0 95 434 303 
25 180 36.8 8.5 101 445 24.5 156 33.6 18.7 62 427 197 
16 423 34.9 19.9 87 43.3 214 307 33.8 36.7 77 400 245 
17 530 348 250 40 39.9 9:7 239 342 28.6 23 354 13 
18 562 33.1 26.5 76 33.8 9.1 

19 215 732.5. 10.0 

20 ib EM 3.5 

21 310953 1.5 CE 


Totals 2123 343 100.0 413 45.8 1001 836 341 1001 314 417 993 
Mean square 


years 745.1 208.5 121.0 1743 
Mean square 
error 97.7 32.6 105.5 854 
F 7.635 6.39%" 145 2.05 


** Indicates F to be significant at the 1 per cent level. 
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Another aspect of the secondary school student in "Tanzania is also suggested 
by Table 2 and perhaps bears some relationship to earlier comments about 
age and the problems of age determination. It will be seen that the age of the 
student is related to the ethnic group of which he is member. The mean ages 
for the four groups are as follows: African boys — 17.1, African girls = 16.1, 
Asian boys — 14.8, and Asian girls — 14.3. African students are, on the 
average, older than their Asian counterparts; and boys tend to be older than 
girls within the ethnic group. This same phenomenon would be observed in 
students in other countries and, in the case of the sex discrepancy, would 
probably reflect the tendency for boys to mature more slowly than girls or, 
alternatively, the greater stress placed on the education of boys, both in 
"Tanzania and elsewhere. 


D. Discussion 


As indicated in the preceding section, our analyses reveal 

1. The PM is largely ineffectual in detecting differences between African 
tribal or Asian community subgroups. 

2. Significant differences in mean level of PM performance exist between 
all Asians and all Africans when these groups are compared within form and 
sex groupings. 

3. With two exceptions girls and boys in the same form and of the same 
group (і.е. African or Asian) earn equivalent mean PM scores. 

4, There is a tendency for mean PM scores to vary inversely with age, and 
this is significant in both the combined African and Asian male groups. 

5. Mean ages for Asian groups are significantly lower than for those of 
like-sexed African groups. 

The failure of the PM to differentiate between the tribal or community 
subgroups (with the exception of Form 2 African boys) is taken to mean that 
the differences between the subgroups either do not exist; that, if they exist, 
they are independent of performance on the РМ; or, more likely, that the 
rigorous process of selection for secondary school (about 4 per cent of Tanzanian 
children are able to go to secondary school and this happens only after fairly 
careful screening at the end of the fourth and again at the end of the seventh 
year of primary education) serves to call forth a fairly homogeneous set of 
skills which must be manifested by the students and which are fairly closely 
related to the PM. On one sample of 117 secondary school students calculation 
showed the correlation of the General Entrance Examination (the screening 
test given to select students for secondary school) and the PM to be .242 with 
scores on the former measure badly truncated. In the one instance where there 


212 JOURNAL OF SOCIAL PSYCHOLOGY 


is a tribal effect (Form 2 African boys) this may be partly due to the vagaries 
of sampling. One of the few secondary schools in Tanzania with a long tradition 
of academic excellence supplied 26 out of the 68 Nyamwezi students in the 
sample and, as we have noted above, a high mean for the Nyamwezi was 
associated with many of the differences between pairs of means. The students 
from this school had a mean of 40.7 on the PM. The mean for Nyamwezi 
students attending other schools was found to be 36.8 and the difference 
between these means (SEaiyference = 2.0) is 3.9. This produces a 7 of 1.95 
which is significant at about the 5 per cent level. The other tribe identified 
with significant 2 tests between pairs of means for Form 2 African boys is 
the Nyakyusa, who show a low mean and a fairly large N. Where their mean 
differs from that of other tribes, it is for those subgroups that are also quite 
large numerically, but there seems to be no extraneous (e.g., sampling) anomaly 
that is associated with their relatively low performance. 

The significantly better performance of the Asians than the Africans at all 
educational levels on the PM is probably to be associated with a number 
of cultural factors that differentiate the two groups. Their environments 
differ markedly; the Asians live in the towns, they are likely to start school 
earlier than African students who mainly come from rural environments and 
peasant families, they have daily contact with many of the amenities and the 
stimuli of modern urban life, and they are likely to grow up in a literate 
environment. In another investigation that involved University students, the 
investigators observed this same pattern of discrepancy in mean PM scores and 
tried to ascertain reasons for it. The investigators concluded, on the strength 
of the oral reports of 72 students with whom they talked, that part of the 
difference seemed attributable to the language the student used to manipulate 
the exercises found in the PM. The Asian students tended to resort to one of 
the Asian languages (e.g. Gujerati), while, in generating solutions to the 
designs, the Africans almost invariably employed Kiswahili. As a side comment, 
English is usually the third or fourth language for the secondary or univer- 
sity student in "Tanzania and it tends not to be used in ordinary social com- 
munication. For purposes of completing the PM Kiswahili seems to be les 
powerful when it comes to handling some of the more abstract ideas that 
need to be grasped in order to produce solutions for some of the exercises— 
divergence and convergence, for example, and other concepts. The Indian 
languages are apparently more effective and economical in dealing with these 
complex abstract notions and represent an additional cultural factor which 
is probably involved with the PM scores. 

The inverse relationship (significant for both groups of boys) between 28° 
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and mean PM scores is difficult to rationalize. Characteristically, the PM 
scores level off and remain fairly constant over the period from age 14 to age 
20 or more. The ratio of selection of secondary school students in recent years 
in Tanzania has tended to become somewhat more extreme: that is, a smaller 
proportion of Standard УП students are admitted to secondary school, but the 
proportion has not changed markedly and there is still a wide range of ages in 
all forms of secondary school. The bases for selection have likewise not 
changed. This is demonstrated by the failure of a negative trend to develop 
when PM mean scores are analyzed by form level. What may be reflected in 
the inverse relationship between age and PM scores is the fact that education is 
so highly prized that individuals will persist after it for a number of years, 
repeating primary school by employing any of a number of stratagems— 
changing names and moving to another part of the country, attending private 
schools, and the like. The older students who, after a number of attempts, 
finally manage to enter secondary school may be less intelligent; and the per- 
formance of older groups on the PM may arise from the interaction of pro- 
tracted primary schooling and lower aptitude. Some support for this line of 
argument is found in the fact that the age associated with lower performance on 
the PM is 17 for both the Asian and African boys. A student of 17 or more 
would be above the typical age grade figure for the individual who started 
school at six or seven and proceeded uninterruptedly through primary and 
into secondary school. Those students who are “over age in grade” may be either 
the late starters or the slower learners, and either or perhaps both conditions 
may be reflected in their performance on the PM. 

As suggested above, the differences may also relate to the fact that the 
African students typically start school at a later age than Asian students do. 
The Asian child, living in town with schools readily accessible and with school 
fees also available (since the economic status of Asian families is generally 
high enough to support the charges, and primary schools in Tanzania charge 
tuition) will begin at the age of five or six. The African child, coming from 
a much poorer economic environment with schools more remote and with 
payment of the modest school fees likely to be a serious problem for the family, 
may have to postpone starting elementary school until the age of eight or nine. 
The difference in starting time may have significant implications for perfor- 
mance on tests like the PM because the period of most rapid intellectual growth 
will have been passed for them and the school environment may simply have 
less impact on the skills related to performance on the PM. To be completely 
consistent this argument should apply with equal force to the African girls, 
however, and it does not. The number of girls admitted to secondary school 
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is quite small, though, and the girls able to enter school show some evide 
of coming from the more highly educated and the economically more privile 
segments of the population. 

Our results support Irvine’s contention that test performance is a functi 
of environmental factors if it be accepted that the environmental conditions 
Asian students are relatively more advanced and industrialized than those 
African students, ‘The pattern of scores demonstrated by these groups differ 
markedly from those of the African boys and girls, and these differences am 
likely to be reduced as the national goal to improve environmental conditions 
realized and as school opportunities become equally available to all. Condition 
of education seem to be sufficiently standard at this point so that the differeno 
between ethnic subgroups are mainly independent of school factors except in 
far as the ethnic composition of individual schools may interact with perf 
mance on the PM. 


Progressive Matrices Test is analyzed. Differences in mean lew 
of performance were found to be associated with ethnic group (African 
Asian), age, and sex of the student, but were independent of form level 
tribe or community groupings. 
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CHANGES IN THE JUDGMENTS OF SENSITIZERS AND 
REPRESSERS IN RESPONSE TO FAILURE AND 
SUCCESS EVALUATIONS OF GROUP 
PERFORMANCE* * 


Department of Piychelogy, Emory Usierrrity 


Artnur М. Conex? лхо Ricnaro М. CARRERA 


А. IxTRODUCTION 


same continuum. Repressers cope with anxiety through avoidance of anxiety 
arousing stimuli, while sensitizers approach and attempt to control such stimuli, 
This categorization has been found to relate to a large variety of behaviors 
A comprehensive review of the R-S personality dimension is presented by 
Byrne (3). 


1. Procedure 


‘Twenty-four five-man groups, composed of paid, male undergraduate volun: 
teers from Emory University, were divided into two sets of 12 sensitizer and 
12 represser groups according to scores obtained from “Health and Opinion 
Survey." This scale is a refinement by Byrne (2) of an earlier MMPI mea- 
sure used by Altrocchi, Parsons, and Dickoff (1). A split-half reliability of 96 
and test-retest reliabilities of .88 (one to 18 months), .71 (19 to 36 months) 
and .54 (37 to 85 months) have been reported by Byrne (2). It appears, there- 
fore, that this scale is measuring a reasonably stable response tendency. 

"The 60 sensitizers and 60 repressers were selected from a pool of 200 
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for whom R-S scores had been obtained and represented the ends of the d 
bution. The "Survey" is composed of the total of D plus Pt plus Welsh Anxi 
scores subtracted from the total of the L plus K plus Hy denial scores. It 
127 scorable items. The scoring increases monotonically as the continu 
proceeds from repression to sensitization. The high scores indicate sensiti 
Each set of 12 groups was divided into four homogeneous subsets according 
R-S scores. Scores within each subset of three groups were distributed h 
erogeneously. The groups within each subset were randomly assigned to ¢ 
of three evaluation conditions: failure, success, and neutrality. This procedi 
results in a 2 X 3 design with four five-man groups in each cell. 
Each group spent approximately three hours working on 45 group pro 
whose solutions required interdependent actions through the use of { 1 
messages. The problems ranged from simple tasks requiring the determin: 
of common symbols, from among those that members had, to complex t 
involving the location of missing parts to stories and the identification of th 
group members who held them. The actual time taken by each group was 
corded and ranged from 45 minutes to 35 seconds. Subjects were seatet 
booths around a circular table and communicated only through written n 
sages. Before the trials, on the pretext of not having a watch, the experimen! 
“borrowed” each subject’s watch. Postsession discussions revealed that s 
did not know that watches were being collected systematically and d 
become suspicious about the matter of time. After 45 trials (lasting about 
hours), subjects were asked to fill out questionnaires requiring their ind 
judgments of the time taken by their respective groups on their fastest tri 
the time the fastest trial should have taken, the time taken on their slow 
trial, and the time the slowest trial should have taken. These judgmen 
perceived and ideal times were made again by subjects following an ev: 
of failure, success, or neutrality given by a different experimenter, after 
had been brought to a different room in which they were face to face 
one another. Four represser and four sensitizer groups were randomly à 
to each of the failure, success, and neutrality conditions. In all condition: 
experimenter explained that speed of problem-solving was the most imp 
factor in group effectiveness and of primary interest. In the success cone 
subjects were told that their performance was very fast and that they У 
in the top 20 per cent in comparison with other groups. In the failure cone t 
subjects were told that their performance was very slow and that they 
in the lowest 20 per cent in comparison with other groups. In the n 
condition there was no mention of comparative performance. After the 
nouncements, subjects were instructed to engage in a 15-minute g" 
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cussion of their problem-solving performance. As soon as these remarks were 
concluded, subjects were asked to fill out the same questionnaire that they filled 
out previously. As before, responses to the questionnaire were written indi- 
vidually, but this time in the face-to-face presence of others. 


2. Hypotheses 


The overall hypothesis was that sensitizers would be more influenced than 
repressers by evaluations of failure or success in the directions of these authori- 
tative evaluations. This generally expected greater responsiveness to perfor- 
mance evaluation was broken down into the following specific subhypotheses : 
(a) Following failure evaluation, sensitizers, more than repressers, will in- 
crease their estimates of the actual times taken by their groups to solve prob- 
lems and decrease their estimates of the amount of time which their groups 
should have taken to solve problems. (6) Following success evaluation, sensi- 
tizers, more than repressers, will decrease their estimates of the actual times 
taken by their groups to solve problems and increase their estimates of the 
amount of time which their groups should have taken to solve problems. 


С, RrsuLTS 

Neither of the hypotheses was supported, Sensitizers did not differ signifi- 
cantly from repressers in their pre-post changes under failure or success. 

Analyses of estimates of time taken to solve problems in the before and after 
questionnaires, treated separately, reveal that sensitizers did not differ sig- 
nificantly from repressers in their actual (actual refers to time recorded by the 
experimenters) fastest times, actual slowest times, perceived fastest times, 
perceived slowest times, ideal fastest times, and ideal slowest times. 

Analyses of changes from the first to the second set of estimates made by 
sensitizers and repressers, treated separately, reveal (Table 1) that repressers 


significantly changed their estimates in only one category: that is, under the 
failure condition, repressers decreased their estimates of the fastest times they 
believed their respective groups should have taken to solve problems, This was 
ted on the basis of the failure evaluation. 


change following success evaluation. Under 
failure, sensitizers, in contrast to repressers, and consistent with 

based on the inductions, increased their perceived slowest times and d 

their ideal fastest times. Thus, following an evaluation of failure, sensitizers 


changed their estimates indicating that they believed e 
longer and their ideal times should have been shorter than they had for esti- 
mates before the evaluation. Under success, sensitizers decreased their estimates 
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TABLE 1* 
CHANGES IN TIME JUDGMENTS WITHIN R AND S ом Pre (Q4) AND 


Post (О„) QUESTIONNAIRES 


Fastest Fastest Slowest Slowest 
perceived ideal perceived ideal 

Condition R S R S R 5 R s 
Failure 

XQ, 39.50 63.50 35.25 52.15 181.00 206.50 188.68 153.00 

XQ, 35.50 60.25 28.50 41.25 190.50 241.94 13342 141.20 

SD 14.19 1647 11.93 17.75 43.22 40.30 61.07 6422 

df 19 19 19 19 19 19 19 19 

t 1.262 .883 2.5228 2.746 .982 3.929 375 821 

РА с аш. 05 .01 oS 01 — — 
$иссезз 

XQ, 48.15 43.755 35.80 37.20 293.25 229.50 13410 148.50 

XQ, 4645 38.60 36.60 31.80 289.85 209.80 145.73 138.50 

SD 846 10.02 543 14.18 3123 38.75 3414 — 4401 

df 19 19 19 19 19 19 19 19 

t 899 2.259 661 1.703 486 12272 1525 101 

РА — 05 ЕНИ HUN .05 — — 
Neutrality 

XQ, 42.55 45.25 35.30 37.25 204.75 169.25 143.00 112.50 

XQ, 38.80 45.75 30.55 35.95 201.50 197.25 135.40 147.00 

SD 10.93 394 1095 4.43 45.58 57.34 39.58 50.52 

df 19 19 19 19 19 19 19 93 

1 1.531 284 1.939 1.313 319 2182** .859 3.053 

2 05 — 4 


* All tests are one-tailed, excepting those involving data in the neutrality conditions 
which were two-tailed. 


** Changes by sensitizers in the neutrality condition were significant for both 
Slowest Perceived and Slowest Ideal, but in opposite directions, thus shifting the over- 
all level, but not indicating unidirectional psychological changes. 


of perceived group times for both fastest and slowest times, consistent with 
the evaluation. 


D. Discussion 


The results of this experiment are inconsistent with the results of previous 
research in which sensitizers and repressers were studied as individuals. A 
possible explanation for these unexpected findings might lie in the effect of 
group membership on the emergence of individual response tendencies, evel 
when they are habitual. Making judgments about group performance as 4 
member of a homogeneous group could conceivably alter the quality of an 
individual’s responses to emotion-arousing stimuli through the creation © 
psychological distance: i.e, by decreasing one’s individual sense of respons- 
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bility for the performance of the group and through anxiety reduction as a 
result of simply being in a group situation. The latter interpretation is sug- 
gested by some of Schachter’s (4) findings. 

Perhaps the most significant conclusion that we might draw from this experi- 
ment is that group factors may constitute a set of mitigating conditions that 
intervene between the induction of stimuli and their effects on the manifest 
productions of personality. Group factors were believed to have operated in the 
experiment in two ways: first, via the task of making judgments of group per- 
formance; and second, via the anxiety-reducing presence of similar others in 
a group context when confronted with social evaluation. 


E. SUMMARY 


This study tested the general hypotheses that small homogeneous groups of 
sensitizers and repressers would differentially judge the performance of their 
groups following exposure to failure and success evaluation. This expectation 
was not confirmed, although previous research with individual repressers and 
sensitizers has consistently shown differences in response to anxiety-arousing 
stimuli. It was suggested that group factors may constitute a set of mitigating 
conditions that intervene between the induction of stimuli and their effects on 


the manifest productions of personality. 
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HOW “OPEN” ARE MULTIPLE-DWELLING UNITS?*! 
Department of Psychology, State University of New York at Bufalo? 


Јонх M. McGrew 


A. INTRODUCTION 


Although New York State law prohibits discrimination in multiple-dwelling 
units—i.e., dwellings inhabited by four or more residents—and the State 
Commission on Human Rights had received no complaints regarding sus- 
pected discrimination, it was popular opinion that discriminatory practices still 
existed. Therefore, an empirical investigation of such practices seemed useful 
to develop data having both general interest and practical value, In regard to 
the latter, information concerning the “openness” or availability of various 
apartments could be used by potential Negro renters; discriminating landlords 
could be effectively handled by the proper authorities; and, finally, such action 
could open housing still further. 

On a more general plane, it was thought that the data would provide evi- 
dence for one of two points of view: the notion originating with William G. 
Sumner (7) that “stateways cannot change folkways,” or the conclusion of 
Deutsch and Collins (2) that “official policy, executed without equivocation, 
can result in large changes of behavior and attitudes despite initial resistance 
to that policy.” 

The study was also seen to bear upon the contradiction between verbal atti- 
tude and behavior, as originally suggested by LaPiere (4). 

LaPiere’s data showed that while hotel proprietors expressed negative 
attitudes toward the Chinese people, they exhibited positive behavior towards 
a Chinese couple. In the present investigation, it was hypothesized that the 
reverse situation would occur: i.e., apartment landlords would express positive 
attitudes towards Negroes, by a willingness to rent to them, but would, in 
practice, exhibit negative attitudes, by discriminating against a Negro couple. 

A final question posed was derivable from the theory of “No Social Equality” 
(5). Myrdal noted that although interracial marriages are viewed negatively 
by white Americans, there was much more ostracization in the case of a 
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Negro male married to a white female, than in the case of a white make 
married to a Negro female. Hence, it was hypothesized that a Negro male 
white female couple would be less able to obtain an apartment than a white 
male-Negro female couple. 


В. METHOD 


Ten multiple-unit apartment dwellings, located in the central and suburban 
areas of Buffalo and which advertised vacancies, were selected. The student 
experimenters (Es) were divided into white (W) and Negro (N) couples, 
who visited a given apartment in a white-Negro-white or ABA order, approx- 
imately an hour apart. 

In the case of the interracial couples, five of the 10 apartments were 
studied. The white male-Negro female (WN) couple visited a given apart 
ment and was followed, approximately an hour later, by the Negro male-white 
female (NW) couple. Both couples visited the five apartments prior to the 
W and N couples. 

Prior to the day of the experiment, all of the student couples participated 
in a workshop. During the course of these sessions, the importance of mini- 
mizing between-couple manner differences was stressed. For this purpose, 
each couple was instructed to dress neatly, preferably shirts and ties for the 
men and skirts and blouses for the women ; couples were instructed to mention 
that they had no children; the girls were requested to purchase wedding bands; 
and, finally, all of the couples were asked to inform the landlord, after the 
sought information was obtained, that the apartment was unsuitable. 

Besides requesting that the apartment be shown to them, each couple detet- 
mined the rental and the willingness of the landlord to rent. Several weeks 
later, each landlord was telephoned and asked if he would rent to a Negro 
couple (verbal attitude) and if he were acquainted with the State law barring 
discrimination in multiple-dwellings. 


C. RESULTS 


The data showed that, at the time sampled, some discrimination was taking 
place. That is, in four cases, the landlord refused to show the vacant apart 
ment to the Negro couple and, in one case, increased the rental. Furthermore 
in six cases, the landlord refused to rent the apartment to a Negro couple 
offered a variety of excuses, such as “there is a long waiting list,” “a 30-day _ 
notice" or “60-day notice" is needed, or “the apartment is already rented.” 

In order to determine the statistical significance of these findings, Fisher's 
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Exact Probability Test (6) was computed on the information obtained by the 
couples. In each instance, the null hypothesis was that there were no differ- 
ences in the responses received by the W and N couples. Although there were 
no significant treatment differences in regard to the landlord's willingness: to 
show the apartment or the amount of rental requested, there was a racial dif- 
ference obtained in the case of the landlord's willingness to rent the apartment 
(p = .02). Since this difference was in the expected direction—that in W 
couples more readily obtained the apartments than N couples—the one-tailed 
significance level is .01. 

The telephone inquiry revealed that for the six landlords who refused to 
rent to a Negro couple, their verbal attitude was not consistent with actions. 
Furthermore, four of the six landlords claimed unfamiliarity with the State 
law. In the first instance, the difference was significant at the .05 level and, 
hem .05. Both of these comparisons were tested by Fisher's 

rmula. 

In the еме of the interracial couples, no differences were нй i i 

When the data obtained from the interracial couples are compared to those 
obtained from the Negro couples, however, it appears that three apartments 
attainable by the interracial couples were unattainable by the Negro couples. 
Fisher's Test was computed for these differences and found to be not significant. 


D. DiscussioN AND CONCLUSIONS 


The data seem to indicate that, at the time sampled, statute had not been 
effective in producing simple compliance, thus supporting the notion that 
"stateways cannot change folkways.” Although the data further show that 
66 per cent of the discriminating landlords claimed no knowledge of the law, 
it is not discernible whether this is just a “claim” or suggestive of inadequate 
channels of communication extant between the proper authorities and those 
concerns which manage public housing. Future research could possibly elimi- 
nate this likely error of interpretation by indicating whether or not the proper 
authorities have, in fact, informed public housing managers of their obligation 
in upholding nondiscriminatory practices where laws, to that effect, exist. © 

When verbal attitudes concerning discriminatory practices were compared 
with behavior, the data were consistent with the now classic suggestion of 
LaPiere. His original data, however, indicated negative verbal attitude, yet 
positive action towards a Chinese couple; the present data reverse that trend. 
Namely, the results indicated positive verbal attitude but negative behavior 
in the situation. On the one hand, these findings seem consistent with the 
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prevailing attitudes surrounding interracial housing as recently indicated in 
a nation-wide survey (3). On the other, the results seem to provide one of 
the first instances of what Campbell (1) has called “inconsistency.” 

Campbell criticizes the LaPiere study because it confused correlational in- 
consistency with situational threshold differences. The net result, Campbell 
argues, is an exaggeration of the inconsistency present. According to Campbell, 
“inconsistency would be represented if those who refused face to face accepted 
by questionnaire, or if those who accepted by questionnaire refused face to 
face. There is no report that such cases occurred” (1, p. 160). 

The present data seem to provide evidence for both types of inconsistency 
and suggest the relative unimportance of verbal attitude content, concerning 
the racial issue, when behavior is to be predicted. 

Finally, there seemed to be no treatment differences between the interracial 
couples and, as a consequence, some question is raised about Myrdal’s (5) 
formulation. The small number of apartments sampled, however, does not 
allow for broad generalizations. Such a trend does merit further research and 
especial concern should be given to the differences between interracial couples 
and Negro couples. 
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SOCIAL INFLUENCES ON JUDGMENTS INVOLVING 
THE LOWER SENSES: ODORS*! 


Department of Psychology, State University of New York at Albany 


ABRAHAM S. LUCHINS 


A. INTRODUCTION 


Many studies on social influences on perception and judgment utilize visual 
stimuli as evidence and yet often draw generalizations concerning the entire 
realm of perception and judgment. Such generalizations may be valid for 
much of man's perceptual and judgmental behavior because of the dominant 
role that vision plays in his life. But men also make judgments about pain, 
pressure, warmth, cold, smell, taste, weight, etc., which involve the chemical 
senses, cutaneous senses, and the kinesthetic sense—the so-called lower senses. 
It seems important to investigate whether judgments involving the lower 
senses would also be influenced by the same social factors that have been 
found effective in influencing judgment of visual evidence. The author has 
elsewhere (6) conjectured that such social factors would be less effective in 
judgment situations involving the lower senses than in judgment situations 
involving visual evidence; that is, Ss would react to the evidence in terms of its 
physiological effect on their sensorium instead of reacting to what others say 
about it. 

The present study was conducted to test this conjecture, using chemical 
solutions as evidence (objects of judgment). In each pair of solutions given 
for judgment, one solution had a more unpleasant odor than the other. The 
assigned task was to smell the solutions and to choose the more pleasant of 
the pair. Thus, the “correct” choice was the solution with the more pleasant 
odor. Social factors in the form of overheard responses and scoring of responses 
were introduced to foster "incorrect" choices. Such factors were used in 
previous studies (4, 5) of social influences on judgment of the length of line 
segments. In line with the above conjecture, the author hypothesized that the 
social influences would be less effective with the odors, because of their physio- 


logical impact, than with the line segments. 
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B. Метнор 


1. Confirmation by Experimenter of Confederate’s Incorrect Choices 


Instead of the five pairs of line segments that were used in previous studies. 
to minimize and maximize social influences on judgment (4, 5), five p їп 
of test tubes containing solutions of ammonium hydroxide served as objects of 
judgment. Although the solutions in all the test tubes appeared to be similar, 
they differed in molar strength of ammonium hydroxide. The molar strengths 
were zero and 1⁄4, zero and 1, 4 and 2, 1 and 3, and 1 and +, in pairs one 
through five, respectively. In each pair, one test tube had pain ed on it the 
letter A, and the other the letter B. The announced task was to sniff cach 
test tube and to report out loud whether A or B was the more pleasant-smelling: 
solution. 

The experiment was administered, in the same manner as in the studies usi 
line segments, to two individuals at a time, one of them a naive $ and the 
other E's preinstructed confederate. The Ss were 10 college sophomores, five 
males and five females, The confederate, a college student of the same sex 88 
$, always judged first. In the present experiment he always chose incorrec 
that is, he selected the solution that was actually the stronger of the pai, 
but said that it was the more pleasant-smelling solution. He had, prior to te 
experiment, practiced pretending to inhale deeply when sniffing the test tubes 
so that he would not react to the noxious odors, After he sniffed and repo te 
his selection out loud, he passed the pair of test tubes to S, who was tok 
by E to make his choice. In this way, all five pairs of solutions were judged. 
After $ had made his selection of the fifth pair, E gave S and the confede 
their scores on the test. The confederate was told that he had 100 per ce 
S was given as his score the percentage of times that he had selected the s? 
solution as the confederate (he was not told on what basis the score V 
calculated), E then told them that they would be given another chance. 
this manner the series was readministered five times. 

Ss very infrequently chose the stronger solution of ammonium hydroxi e 
as the more pleasant-smelling one, even though the confederate selected it а 
even though E confirmed his selection. Ss tended to be guided by the objective. 
evidence and not by the social evaluation of their judgments. On the fifth pre 
sentation of the solutions, there were only 20, 10, zero, zero, and zero per 
incorrect choices (an average of 6 per cent). The results are strikingly di 
ent from those obtained with the same procedure when pairs of line segments 
were the objects of judgment. On the fifth presentation of the line segmen ts, 
10 college students gave 80, 60, 60, 30, and 30 per cent incorrect choices 
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for pairs one through five, respectively (an average of 52 per cent). On the 
earlier presentations, also, there was strikingly more agreement with the con- 
federate for the line segments than for the odiferous solutions. 

The behavior of the Ss while they were judging the odors indicated some 
reasons for the lack of social influences on their judgments. When they sniffed 
the solutions, they winced, turned their heads away, looked startled, or were 
even annoyed and angry. They made facial contortions suggestive of a painful 
or disagreeable experience and exclaimed words to the effect: “It is terrible," 
“It hurts,” and “It stings.” One $ who agreed on the first presentation of 
the first pair cursed the confederate for misleading him. 

For some Ss, it seemed to be a conflict as well as a stressful situation. On the 
one hand, they would get credit in their psychology class for being Ss and, on 
the other hand, the odor was so unpleasant that they were disinclined to smell 
the solutions. Some tried to resolve the conflict by not sniffing or inhaling. 
One $ said that without sniffing he knew which was the more unpleasant solu- 
tion: namely, the one which the confederate had chosen. 

Ss inquired about or tried to identify the nature of the solution. They also 
wanted to know what determined the correct answer. They challenged the 
confederate's choices or the validity of the scoring key of the test. Such 
challenges were relatively infrequent when line segments were the objects of 
judgment. Moreover, in the line judgment studies, most Ss did not mind 
having the experiment readministered. In the present study they resisted 
repeat administration and E had to urge them to continue. 

А variation of the experiment was conducted using only three pairs of 
solutions for evidence.” One member of each pair was а faint solution of 
cologne; the other one was 3.7 molar strength of ammonium hydroxide, 7.4 
molar strength of ammonium hydroxide, or concentrated valeric acid in pairs 
one, two, and three respectively. Not one of the 15 college students who served 
as Ss agreed with the confederate’s “wrong” selection, after the first presenta- 
tion. Of special interest are the verbal utterances made during the experiment. 
Of 52 spontaneous comments, the most common (21 per cent) concerned the 
nature of the stimuli: Ss asked what the chemicals were; some Ss identified 
the ammonium hydroxide (ammonia), but none identified the valeric acid. 
The next most frequent vocalizations (13 per cent) were comments in which 
Ss justified or affirmed their choices by referring to the unpleasantness ог 
“disgusting” nature of the odors or asked whether they had to choose 2s the 
confederate did, whether the most offensive odor was really wanted, and who 


2 The author is grateful to Michael Spiegler for conducting the experiment with 
the series of three pairs of chemical solutions. 
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determined what was unpleasant. Here, too, the key used to score their answers 
was challenged. One $ said that if the experiment went on all night he would 
continue to choose as he did. 

The present results are similar to those obtained in studies that used varying 
strengths of hydrogen sulfide or valeric acid, each of which was compared 
with faintly scented rose water. The results are also similar to another study 
in which Ss judged samples of margarine and rancid butter that had been 
mixed to obtain varying degrees of "rancidness." In these studies, also, 8 
seldom agreed with the confederate’s incorrect" choices. Further, there was 
great reluctance to continue with the experiment. Most Ss said that they were 
nauseated, “disgusted,” or “sickened” by the smell. It was suggested that they 
walk to an open window for fresh air after each trial. Nonetheless, two of 
the 10 Ss in the valeric acid study and one of the 10 Ss in the hydrogen sulphide 
study refused to go on after the second or third presentation. 

Questioning Ss after each experiment revealed that many did not agree 
with the confederate because of such assumptions and beliefs as the following: 
They experienced a disagreeable sensation when they chose the solutions that 
the confederate called pleasant. Some, therefore, lost whatever faith they had 
in his judgment or they were annoyed at his misleading them. It was obvious 
to them which solution was most unpleasant and it seemed that the confederate 
was purposely selecting the wrong solution. Some were fortified in their 
choices by the belief that people differ in their judgment of odors: what smells 
like a rose to one person may be a foul odor to another person; some people 
find a certain perfume pleasant, others find it mildly offensive; each person is 
entitled to his taste. Ss in the line judgment experiment did not have such 
social supports; indeed, they may have been influenced by the belief that vision 
often involves illusions and that one may be wrong in visual perception. 


2. Infirmation by Experimenter of Confederate’s Correct Choices 


As in the first experiment of the previous section, the confederate and the 
Ss were asked to judge five pairs of solutions of ammonium hydroxide, but 
now the confederate always chose “correctly”: i.e, he selected the weaker 
solution and said that it was the more pleasant. After the confederate and $ 
judged the fifth pair, E said that the confederate had zero per cent and S was 
given as his score the percentage of times that he had disagreed with the 
confederate. 

Ss were not influenced by E calling them wrong. On the fifth presentation 
10 college sophomores gave 10, zero, zero, zero, and zero per cent incorrect 
answers in pairs one through five respectively (an average of 2 per cent). This 
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is in marked contrast with the results obtained when the same procedure was 
used with the line segments. On the fifth presentation of the five pairs of 
line segments, 15 college sophomores gave 60, 60, 40, 33, and 33 per cent 
incorrect choices, respectively (an average of 45 per cent). On the early 
presentations, also, there was markedly less disagreement with the confederate's 
"correct" choices for the ammonium hydroxide solutions than for the line 
segments. 

The procedure of the above-described experiment was also used with the 
three pairs of solutions in which one member of each pair was cologne and 
the other member was 3.7 molar strength of ammonium hydroxide, 7.4 molar 
strength of ammonium hydroxide, and concentrated valeric acid, respectively. 
Not one of 10 college Ss chose incorrectly when E infirmed the confederate's 
correct choices. Similar results were also obtained with the series of pairs of 
hydrogen sulfide, or valeric acid, and of rancid butter. 


C. Discussion ' 


The results support the hypothesis in that social factors, effective in pro- 
ducing incorrect choices when line segments were used, were ineffective when 
odors were the objects of judgment. Whereas in the line segment experiments 
Ss sought to understand the basis of the confederate’s responses and tried 
“also to be right” or “to get 100 per cent on the test,” in the odor-judging 
experiments they seemed to focus on the evidence. The impact of the odor on 
their sensorium may have helped to keep the evidence in the foreground and the 
passing of the test in the background. Ss may now have been concerned more 
with the object of judgment than they were in the line segment experiments 
because they could more readily be indifferent to the length of a line segment 
than to a noxious odor. Perhaps the results indicate that discrepancies among 
external evidence are more readily reconciled by § than discrepancies involving 
contact stimuli. 

That it is not merely a difference between visual and olfactory sensations 
that produced the difference in results was demonstrated in preliminary 
studies in which Ss were asked to select which of two flashing lights or of 
two bright surfaces had less glare. The confederate chose the more glaring 
light or surface of the pair of stimuli. But, regardless of what the confederate 
or E said, rarely did Ss who had to squint or who were startled or dazzled 
by the light say that it was less glaring or more pleasant than the softer and 
more diffused light. It does not seem, therefore, to be merely a matter of vision, 
audition, chemical, or cutaneous sensation but the kind of impact the stimulus 
or evidence has on S. Perhaps the results of many studies of social influences 
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are due to the fact that the transactional relation of S and the evidence is 
of such a nature that he reacts to the evidence as something at a distance, a 
something outside and unrelated to him; the social factors are actually the 
stimuli that are close to $ and affect him. Perhaps different social factors 
have to be used to produce conformity with wrong judgment when the evi 
dence is peripheral in the judgment situation than when it is central or close 
to 8. 

In the line segment experiments, $ was not only judging something at а 
distance but was using social concepts in his judging. He was not reacting 
naively to what he sensed in his phenomenal field, but to a socially meaningful 
object. Experiments on social influences that use such visual evidence are 
using parts of S's perceptual world that have become differentiated and 
patterned in terms of social norms. "These experiments may, therefore, demon- 
strate that social influences are more effective in situations that involve evi- 
dence that has been shaped by social influences. Experiments on social influences 
have dealt mainly with culturally meaningful evidence and with a modality of 
perception that has been culturally differentiated and accentuated. Little of 
no research has been done with the socially undifferentiated aspects of a person's 
behavioral world or with his more primitive reactions to stimulation. Study of 
such primitive reactions, particularly study of how they get transformed by 
experience, might lead to a better understanding of the nature of social 
influences. 

'The present study dealt with one extreme of the hedonic scale: the um* 
pleasant. What would be the effects of social factors if the odors in each 
pair were both pleasant, but of varying degrees? There are studies in the 
literature on suggestibility that indicate that judgment of pleasant odors may 
be affected by social factors (2, 7). 

In the present study, the difference between the solutions in each pair was 
readily discriminated. What would happen if the discrimination was made 
more difficult: that is, if the solutions in test tubes A and B were closer in 
hedonic value, so that the choice between them was less clearcut? Some of 
research literature on social influences (1, 3, 5, 7) suggests that these influences 
tend to be greater in ambiguous situations than in clearcut situations. i 
may lead one to expect that social influences on the judging of odors may be 
greater than in the present study when the discrimination is more difficult. 

In the odor-judging study, $ was caught off guard by the unexpected un- 
pleasant odor and, under the circumstances, he did not learn to give a socially 
acceptable response, or perhaps he believed that it was socially acceptable to 
express discomfort and pain when one experienced it. He might be comp? 
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to a boy who smokes for the first time and vomits, or tastes strong cheese 
and spits it out. The boy soon learns to give socially acceptable reactions to 
the cigarettes and cheese. Similarly, individuals can be trained to give the 
socially acceptable response to odors and not to react to their physiological 
impact. Workers in perfume and fertilizer factories, students in anatomy lab- 
oratories, and soldiers in the battlefield become accustomed to certain odors 
Itis of interest to conduct studies in which an S is accustomed to certain odors, 
by gradually adapting him to them or by embedding the odors in situations 
that will give them different meanings, or by giving him a goal that will 
encourage him to endure the unpleasant odor, In such experiments we may 
be able to study how social factors become dominant in one's cognitive grasp 
of the world to the extent that certain external objects are not reacted to in 
terms of their physiological or primitive impact on the sensorium, 

In conclusion, two approaches to the study of social influences on perception 
suggest themselves: (а) to start with evidence to which one reacts mainly in 


in terms of the evidence. І 
D. Summary 


Four experiments were conducted to test the hypothesis that social factors 
would be less effective in determining the judgment if the objects of judgment 
were unpleasant odors instead of the visual stimuli often used in experiments 
on conformity and social influences on perception and judgment. In line with 
our hypothesis, social factors which produced marked influences on the judg- 
ment of line segments, proved ineffective when Ss were given the task to 
select which of two unpleasant odors was the more pleasant. Ss selected the 
objectively more pleasant odor and not the one which was socially evaluated 
as more pleasant. Ss judged on the basis of the impact of the odor on their 
sensorium and not in terms of the social evaluation of the odor. 
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А. INTRODUCTION 


In studies of social influences on judgments that involve the lower senses 
(6), we found that subjects did not agree (conform) with the overheard 
wrong judgments made by others, but they reacted to the nature of the 
impact of the evidence on them. In these studies, the subject's transactions with 
the object of judgment resulted in an unpleasant experience: e.g. pain or 
disgust. The results may, therefore, have been due to the noxious nature of the 
stimulus and not due to the fact that the evidence related to the lower senses. 
If we minimized the unpleasantness, would subjects still abide by the evi- 
dence when making a judgment involving the lower senses? 


B. GENERAL PROCEDURE 


The procedure was similar to that used in previous experiments on the 
judgment of line segments (4). Instead of a series of five pairs of line seg- 
ments, а series of five pairs of weights were used. The weights were made 
from half-pint milk containers, which were filled with buckshot. Each con- 
tainer had a different alphabet letter printed on it. The weights of the five 
pairs, in the order presented, were м and 1; Y and 2; 1 and 4; 2 and 8; and 
4 and 16 ounces. The weights were presented to two individuals: one of 
them was a naive S and the other a preinstructed confederate. After describing 
the task, E said that the confederate would respond first to each task because 
he had come into the room before S. Then E placed one container in each 
palm of the confederate's outstretched hands and asked him to state, out 
loud, which was heavier. "The confederate stated his choice in terms of the 
letter on the container. Then E put the same pair of weights in the palms of 
the S's outstretched hands and asked him to state which weight was heavier. 
All five pairs were presented in this manner. After the fifth pair was judged 
by 8, E announced, so that both could hear, the scores each had obtained on 
the test. (The method that was used to score the responses will be described 
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in each experiment reported below.) In this way the series of pairs of weights 
was presented five times. Then E left the room and 8 was questioned by the 
confederate. When E returned, the confederate left the room and the weights 
were again presented to S. E compared these new judgments with S’s previous 
ones, and questioned S to elicit the reasons for his choice of weights and his 
reactions to the confederate’s choices. 


1. Confederate’s Incorrect Choices Confirmed 


The experiment was administered to 15 college students. The confederate, 
who was also a college student, always chose incorrectly: i.e., although the 
assigned task was to select the heavier weight, he chose the lighter weight 
and stated that it was heavier. After all five pairs of weights were judged by 
the confederate and S, E said that the confederate had 100 per cent on the 
test. The score assigned to § was the percentage of times that he had given 
the same answer as the confederate. 

The results of this study indicate that, although the object of judgment did 
not give rise to a feeling of unpleasantness, Ss tended to judge in terms of the 
evidence and not in terms of the overheard wrong judgment that Ё had con- 
firmed. Very few Ss agreed with the confederate’s incorrect choices. Only 
27, 13, zero, zero, and zero per cent incorrect choices were made by Ss in 
pairs one through five respectively, on the fifth trial. In contrast, when pairs 
of line segments were judged, a comparable group of college students gave 
87, 60, 33, 33, and 33 per cent incorrect choices in pairs one through five 
respectively, on the fifth trial. Prior to the fifth trial, there also was less 
agreement with the confederate's incorrect choices when weights instead of 
line segments were the objects of judgment. 

There was now a greater variety of behavior than in the line judgment 
experiment. Ss switched the weights from hand to hand, turned the containers 
around, hefted them or put them on the table and then picked them up. Some 
turned the containers around to examine them more closely. Some Ss looked 
at E and the confederate with puzzlement or suspicion. When they finally 
gave their answers some smiled, frowned, looked perplexed, sighed, shook 
their heads, or shrugged their shoulders. Many Ss insisted that the weight 
they had chosen was the correct one, making remarks such as the following: 
“This is heavier and not the one that he said,” “I know I'll be called wrong: 
but I feel that this is heavier.” Some Ss inquired whether the lighter oF 
heavier weight was wanted. Some said that the purpose of the experiment 12 
not to judge weights, but to fool them, to confuse them, to make them question 
their own judgments, or to make them follow the confederate. Some said that 
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there was a "conspiracy" between E and the confederate. More frequently than 
in the line judgment experiment, Ss challenged the scoring key or asked for 
an objective standard: e.g., a scale. Generally speaking, in the line judgment 
experiment Ss tended to entertain doubts about their perception, but in the 
weight judgment they did not. 


2. Confederate’s Correct Answers Infirmed 


In this experiment the confederate, as per instructions, chose correctly 
but E infirmed his correct responses. The confederate was given a score of 
zero per cent and $'s score was the percentage of times that he had disagreed 
with the confederate’s correct answer. Again 85 tended to select what was 
objectively the heavier weight. Fifteen college students from the same pool 
of Ss gave only 7, zero, zero, zero, and zero per cent incorrect answers on the 
fifth presentation of the five pairs of weights. In contrast, a comparable group 
of college students who were given line segments to judge gave 66, 60, 40, 
33, and 33 per cent incorrect answers on the fifth presentation. Thus, we again 
see the relative ineffectiveness of social pressures to produce incorrect answers 
when weights were the evidence to be judged. Some of the comments were 
similar to those in the previous experiment. 


С. REPLICATION 


When the first experiment described above was replicated with 20 junior 
high school students, with graduate students serving as confederates, 30, 5, 
15, 5, and 5 per cent incorrect answers were obtained. When the second 
experiment described above was replicated with a similar group of junior 
high school students, 20, 5, 5, zero, and zero per cent incorrect judgments 
were obtained. Thus, we see that junior high school Ss also tended to focus 
on the weights and not on the social pressures. Those who agreed were more 
often girls than boys. Also, girls tended to agree somewhat more often with 
a male confederate than with a female confederate; boys showed no difference 
in respect of confederate’s sex. 


D. Discussion 


The results support the conjecture that an individual will tend to react 
to the evidence when it has a direct physical impact on him, in spite of social 
Pressure to make him agree or to give a wrong answer. The implication is 
that different sense modalities may be affected differently by the same social 
pressures, 

An alternative explanation is suggested by the qualitative data, In the line 
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judgment studies Ss frequently said that they thought that they were ex 
periencing an optical illusion, but no $ made such statement in the weight 
lifting experiment. The assumption that the line segments exemplified an 
optical illusion may have been due to the fact that popular as well as academic 
psychological literature contains more examples of optical illusions than of 
other kinds. Some of this literature supports the notion that it is normal and 
rather common to experience visual illusions, Moreover, the authors found in 
their research on the occurrence of illusions in daily life that their Ss tended 
to consider illusions involving the lower senses as rather “abnormal.” 

In the weight judgment experiments, Ss seemed more concerned with 
the weights than with the test scores. But in the line judgment experiments 
they seemed to be very much concerned with the test scores and less concerned 
with the line segments. This difference may be due to the fact that in the 
judgment of the lines $ passively viewed a stimulus array which was distal to 
him, but, in judgment of the weights, the evidence was proximal to him and 
he manipulated it, which resulted in a sensation or "feeling" in him. 8 did 
something to the weights, which, in turn, did something to him. "Thus, in the 
judgment of line segments the stimuli were static, but in the weight judgment 
experiments the stimuli were a result of a dynamic event—a transaction 
between the evidence and $. In the line judgment studies, too, there was à 
transaction between aspects of the stimulus manifold confronting the subject; 
however, it was not with the objective evidence, the line segments, but with the 
social influences. "Therefore, the lines, the objects of judgment, receded into 
or remained in the periphery or background of S's cognitive grasp of the 
judgment situation. 

It may also be said that in the weight judgment experiment Ss paid more 
attention to the organismic than to the social factors in the judgment situation. 
Piaget’s concept of egocentrism (1) suggests that Ss gave an egocentric rather 
than a socialized response. Goldstein’s concepts (2) suggest that Ss were 
stimulus bound, that they were concrete-minded and did not assume an abstract 
attitude. In terms of Janet’s concepts of suggestibility (3), it may be said that 
Ss reacted directly to the impact of the stimuli and did not control the effects 
of the stimuli by the mediating processes of reasoning and judgment. In terms 
of a concept from Werner (7), it may be said that Ss reacted to the physioE- 
nomic features of the evidence. Such explanations may suggest that Ss’ reacting 
to the evidence gua evidence is a less mature response, developmentally speak- 
ing, than reacting to the social forces. To interpret the findings in this mannet 
is to subsume the concrete instances of the Ss’ behavior under concepts that 
are related to a theory and to interpret the behavior in terms of a theory instead 
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of viewing the behavior in terms of its function in the situation in which it 
took place. Ss’ reacting to the impact of the evidence led to veridical judgment 
in the present experiments, regardless of whether or not their behavior can be 
interpreted as signs of functioning a lower developmental level. The 
present results point to the need for mulation coretical interpretations 
of perception, in terms of maturity, to include the dimension of fitness or 
adequacy of the response to the objective evidence (5, pp. 168-198). 
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A. INTRODUCTION 


The semantic differential has been used in a variety of situations to gather 
quantitative information on meaning and value (7). Applied to the study of 
aesthetics, it should yield multidimensional results, an advantage over the 
complicated but unidimensional metric developed by S. S. Stevens (8). 


B. METHOD 
1. The Semantic Differential 


Basically, the differential is a list of bipolar scales, such as Good-Bad, 
Valuable-Worthless, with (here) seven possible degrees of judgment, with 
the middlemost degree signifying neutrality or indecision. The initial differ- 
ential was constructed from guesses that were made about what might be 
meaningful descriptions of folkmusic, and revised on the basis of intuitive 
analysis of the data collected from the first several groups. The differential 
eventually settled on was composed of 24 scales. 


2. Subjects 


Subjects were drawn from various academic groups. The groups for the 
first preliminary surveys were composed of junior faculty and their wives. 
Factor analysis was performed only on sets of data that were taken from two 
groups of college students. Group A consisted of 12 volunteers from several 
advanced psychology courses, and Group B contained an equal number of 
undergraduates from an introductory psychology class. 


3. Test Situation 


The folksongs were selected from a collection of several thousand card 
indexed songs on microgroove records. Criteria for selection were, among 
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others, representation of the field from interpreters to folkniks, different moods 
and emotions, and different classes of songs from Child ballads to recently 
composed material. 

Testing sessions were begun by having the subjects fill out a questionnaire 
intended to ascertain their knowledge of folkmusic and peripheral forms of 
recorded songs. Records were played on a music system that utilized a Thorens 
TD 124 turntable, a Shure M232 arm containing an Empire 880P cartridge, 
and an Eico ST70 stereophonic amplifier feeding two Electrostat-2 acoustic- 
suspension speaker systems placed 10 feet apart. All records were played 
monophonic mode, At the beginning of a session, the subjects were advised | 
that they would not be told anything about the song until the end of the 
session, and they were asked not to communicate with each other. After the 
song had been played, the subjects rated their opinion of it on the semantic 
differentials provided them. 


C. Кеѕ01тѕ 


The data used in the preliminary factor analysis were based on eight songs 
each scored by the 12 subjects of Group B on the 24 scales selected. Factor 
spaces for two songs scored by Group A were plotted later. The data were 
subjected to centroid factor analysis. "Three strong factors were extracted. 
"These were then rotated to simple structure by varimax rotation (3). | 

Factor I was strongly evaluative. The following scales had the high loadings 
(greater than .70) : Worth Hearing Again-Not Worth Hearing Again, Good- 
Bad, Interesting-Boring, Performance Good-Performance Bad, Pleasurable- _ 
Not Pleasurable, Meaningful-Meaningless, Material Good-Material Bad, and 
Valuable-Worthless. This factor is plotted in Figure 1, with an arbitrary 
designation of Good and Bad, although all the other scales loading on this 
factor influenced the factor position. 

Factor II was similar to the potency factor which Osgood е? al. have found — 
in their studies of meaning (7). The highest loadings on this factor were on _ 
the following scales: Rugged-Delicate, Rough-Smooth, Hardboiled-Senti- 
mental, Tense-Relaxed, Unsociable-Sociable, and Masculine-Feminine. ‘These 
scales all had loadings of .50 or better on this factor, which is plotted on 
Smooth-Rough axis. Again, the name chosen for the dimension is arbitrary 
the scales loading highest on this factor contribute the most to the factor 
position of a song. 

Factor III was much weaker than the other factors. Its highest loading 
was .58 on the Shallow-Deep scale. Other classifications loading above 45 on 
this factor were Insensitive-Sensitive, Stale-Fresh, Dishonest-Honest, 

x 
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A MODEL or THE VALUE SPACE FOR SOME OF THE FOLKSONGS USED IN THE SURVEY 


The base of projection from each song gives the ratings on the Good-Bad and 
Smooth-Rough axes. The length of the projection gives the rating on the Deep-Shallow 
axis, a solid projection indicating a rating toward the Deep end and a broken projec- 
tion indicating a rating toward the Shallow end. A and B refer to the different 
college student groups (see text). The songs are 5: “Dark Was the Night” by Blind 
Willie Johnson (Folkways 3585), 6: “This Land Is Your Land” by Woody Guthrie 
(Folkways 2481), 7: “This Land Is Your Land” by Peter, Paul, and Mary (Warner 
Bros. 1473), 8: “This Land Is Your Land” by Cisco Houston (Vanguard 9057), 14: 
“Take My Hand” by Cisco Houston (Vanguard 9107), 42: “What You Gonna Call 
Your Pretty Little Baby” by Joan Baez (Squire 33001), 45: “John Hardy” by Lead- 
belly (Stinson 19), and 53: “Captain Kidd” by Phil Campos (Horizon 1616). 


Simple-Complex. This factor is plotted on an axis arbitrarily called Shallow 
to Deep. 

Figure 1 gives a graphic representation of the relative value of each song 
on each dimension. All the songs placed in one of two octants, with all 
factors nonnegative or all factors nonpositive. 

Group A was compared with Group B on ratings of two songs. Cisco 
Houston’s version of “Take My Hand” (a religious song of deep feeling, 
which Houston combined with the satirical “Didn’t It Rain, Little Children”) 
was rated higher on Factor I, lower on Factor II, and higher on Factor III 
by Group B than by Group A, but the scores were close and both groups 
judged the performance to be Good, Smooth, and Deep. 
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Huddie Ledbetter's version of “John Hardy,” sung in the Mixolydian mode 
with concertina accompaniment, was rated higher on all three factors by 
Group A than by Group B, but all the ratings were somewhat unfavorable. 
As indicated in the figure, both groups considered the performance slightly 
bad, rather rough, and just slightly shallow. 

A comparison was made of the scores of different performers singing the 
same song, “This Land Is Your Land,” composed by Woody Guthrie. Ratings 
were done by Group B, as shown in Figure 1. On the evaluative factor, the 
Woody Guthrie performance scored 2 on the Bad side; the Houston perfor- 
mance scored 19 on the Good side; and the Peter, Paul, and Mary rendition 
was 21 on the Good side, only 2 points above that of Houston. On the potency 
factor, Guthrie was 3 on the Rough side; while Houston and Peter, Paul, 
and Mary were 8 and 11, respectively, on the Smooth side. On the third factor, 
Guthrie scored zero; Houston was 9 on the Deep side; while Peter, Paul, 
and Mary obtained a similar score. 

In another comparison, subjects were asked to listen to a song, then rate 
it. After they had made the rating, they were told something about the song 
and the singer. The song was then played again and the subjects, whose first 
ratings had been gathered up, made a second rating. This was done for two 
groups, Group A and the Faculty Wives. The mean value on 12 scales changed 
from the first to the second rating for Faculty Wives, as opposed to only 
five changes in Group A. Of the 17 total changes, all but four were toward 
a higher score on the Good, Smooth, and Deep ends of the dimensions, with 
nearly all the favorable changes being made by Faculty Wives, which perhaps 
suggests that they are more apt than students to be influenced by a label 07 
name. This will be better understood when it is noted that the song was * 
rather hairy version of "Sam Hall," sung by Carl Sandburg (Lyrichord 66). 

Nobody recognized Bing Crosby in his folksinger role, but his presentation 
was judged to be very honest, deep, and sensitive, as indicated in Figure 2. 
Some of the subjects thought Harry Jackson's a cappella and lento version of 
a night-herding song to be insensitive, but that is the way this artist and one- 
time Wyoming cowboy sang it on the range. 

In Figure 2, in order for a profile circle to be in either a left- or right-hand 
extreme degree, more than half the subjects had to check that extreme. Only 
three such median scores were found in the entire survey. Two were given 
Ed McCurdy, who seems unique in his ability to project an impression 0 
potency, The third extreme median was for the good performance of Judy 
Mayhan; no other singer judged by any group was accorded this accolade. 

The correlation matrix showed that among the lowest correlations was 


тш 
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LES FOR THREE SoNGs ON EACH OF THE THREE FACTORS, 
Groups А AND В UsiNG MEDIAN AVERAGES 

Songs plotted on Factor I are 3: “Swannanoa Tunnel" by Erik Darling (Elektra 154), 
15: “Rocking the Cradle" by Judy Mayhan (Horizon 1608), and 54: "Lord, I Just 
Can't Keep from Crying" by Blind Willie Johnson (Folkways 2802). Songs plotted 
on Factor II are 4: “Clinch Mountain" by Ed McCurdy (Elektra 205), 16: “My 
Rambling Boy” by Pete Seeger (Vanguard 9161), and 55: “There Was a Friar in 
Our Town” by Richard Dyer-Bennet (Dyer-Bennet 1601). Songs plotted for Factor Ш 
are 11: “Reek and Rambling Blade” by Fleming Brown (Folk-Legacy 4), 51: “Hang 
Me, Oh, Hang Me” by Bing Crosby (Victor ГОР 6070), and 52: «І Ride an Old 


Paint” by Harry Jackson (Folkways 5723). 


SEMANTIC DIFFERENTIAL PROFI 
OBTAINED FROM 
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Good-Bad with Masculine-Feminine, Deep-Shallow with Relaxed-Tense, and 
Unsociable-Sociable with Simple-Complex, statistically implying that the Deep- 
Shallow scale, for example, tested quite a different set of values from the 
Relaxed-Tense scale. Among the highest correlations was Rugged-Delicate 
with Rough-Smooth, in this case statistically implying that one of these scales 
could be dropped from the differential, since they measure the same set of 
values, Good-Bad correlated highly with either Worth Hearing Again-Not 
Worth Hearing Again or with Performance Good-Performance Bad, but the 
correlation for the latter two scales was much lower, suggesting they should 
be retained in the differential. 


D. Discussion 


It would be difficult to avoid a suspicion that the values of many college 
students, including most of the subjects in these experiments who considered 
themselves knowledgeable in the genera, are quite different from the values of 
folkmusic collectors and critics. Many of our subjects were puzzled by Blind 
Willie Johnson’s “Dark Was the Night,” of which Harold Courlander has 
said, “In listening to Johnson’s ‘Dark Was the Night’ one is not immediately 
aware that he is hearing an expression of religious feeling. Marked by hum- 
ming, moaning, foggy voice, and tones resembling those of the accompanying 
guitar, the rendition fulfills all the requirements of blues singing at its best” 
(2). A similar reaction was elicited by Johnson’s “Lord, I Can’t Keep from 
Crying,” which was and is an important element in the development of folk- 
music and modern jazz (1). 

Nor did the students care for the singing of Woody Guthrie, who is not 
only a major figure in the history of recorded folksong (5), but was for 
many years a significant author in the protest song movement (4), with which 
modern college students do appear to identify. 

The most surprising finding was a practically universal liking for Cis 
Houston, who was not in any sense a trained singer. Perhaps the reason for 
this has been well adumbrated by Nat Hentoff when he said that Cisco 
Houston did not need to go to the people for his songs: he was one of 
people (6). In retrospect, one concludes that with effective promotion of the 
type that characterized the rise of the Peter, Paul, and Mary trio, Houston 
who was Guthrie’s protégé, could have been a successful singer instead of the 
wandering salesman who was hard put to find occasions when his singing 
could be sold. "1 , 

In conclusion, the initial and simple aim of ascertaining if the semantic 
differential could be adapted to the study of the value of folksong seems Юю 
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have been furthered by the finding that different songs, different performers, 
different groups, and different sets of prior information all yield some differ- 
ences of opinion, A working assumption of this survey was that singers such 
as Joan Baez and groups such as Peter, Paul, and Mary have attained their 
prominence while leaving unchanged the popularity level of the more basic 
type of American folksong typified by the singing of Woody Guthrie, Frank 
Proffitt, Alan Lomax, Leadbelly, John Jacob Niles, Obray Ramsey, Jean 
Ritchie, and Frank Warner; the country blues singers such as Son House, 
Blind Lemon Jefferson, Blind Willie Johnson, Furry Lewis, John Hurt, and 
Big Bill Broonzy ; the interpretive singers such as Jack Elliott, Sandy Paton, 
Sam Hinton, John Greenway, Ed McCurdy, Richard Dyer-Bennet, and 
Cisco Houston; and such relatively unknown but potentially moving singers 
as Judy Mayhan, Phil Campos, Gordon Heath, Walt Robertson, Robin 
Roberts, and Will Holt. We found nothing in this study to overthrow this 
assumption. 
Е. SuMMARY 

In this paper the use of the semantic differential to collect opinions from 
college students on the value they give to selected folksongs was described. 
Factor analysis was performed on some of the data and three factors with high 


scale loadings were extracted. A value space was plotted for some songs rated 
ofiles given in some other instances. The 


under different conditions, and pri 
tentative conclusion is that performed folksong can be aesthetically evaluated 


in a meaningful way with the requisite psychological tools. 
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A. INTRODUCTION 


Aronson and Carlsmith (1),1 in an experiment that has received considerable 
attention (2, 3, 4, 5, 6), demonstrated that Ss will alter previous responses 
when these are dissonant with prior expectations, even when such alteration 
results in poorer objective performance. Attempted replications have yielded 
very mixed results. It will be the purpose of the present paper to point out 
inadequacies in the type of analysis pursued by Aronson, as well as other 
students who have replicated the experiment. These inadequacies cast doubt, 
not only upon the original Aronson finding, but also upon most replications. 
Further, two experiments will be reported that examine aspects of the dis- 
sonance-performance alteration phenomena. 


B. THE Aronson DESIGN 


The design originally employed by Aronson has been followed in its essential 
details in all attempted replications.? 

The experimental procedure involved a 2 X 2 design in which high and 
low expectancies were combined with high and low performance treatments 
(HH, HL, LH, and LL).? S was presented an experimental task and assured 
that it represented a well standardized “test of social sensitivity.” Ё informed 
$ that some people did very well on the “test,” getting as many as 85 per 
cent of the judgments right; others as few as 20 per cent correct, 

The task consisted of five blocks of 20 cards. Each card contained three 
Pictures, one of which was purported to be that of a “schizophrenic.” It was 
$$ task to pick out the schizophrenic. There were, of course, no right answers; 
all values reported to 8 represented experimental manipulations. 


After each set of 20 cards had been completed, $ was informed of the number 
ae 
* Received in the Editorial Office, Provincetown, Massachusetts, on May 31, 1966. 
Copyright, 1967, by The Journal Press. 
Hereafter abbreviated Aronson, 
Several studies reported by Brock, e£ al. (2) are exceptions. This paper will 
Testrict itself to Brock’s studies that utilized the Aronson design. 


its he first letter refers to the expectancy level and the second to the performance 
evel, 
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of “right” responses she had obtained. After each of the first four blocks of 
20 cards, $ was told cither that she had obtained, respectively, 17, 16, 16, 
and 17 right (high expectancy inductions) or that she had 5, 4, 4, and 5 right 
(low expectancy). After the fifth block of 20 cards, $ again obtained а score, 
either 17 or 5, that represented her fifth trial or so-called “performance” 
score. 5 obtained this value by scoring the sheet upon which E had “recorded” 
8's responses. E had, of course, “recorded” $'s responses in such a manner as 
to determine the score $ would obtain when correcting the sheets herself. $% 
actual responses were also recorded for this fifth trial. 

Before $ had scored the sheet purporting to contain her fifth trial responses, 
E pretended to have made an error ; namely, $ was informed that E was to 
have timed her last trial responses. After ruminating about what to do, E 
informed S that she would just have to take the fifth section of the test over. 
Specifically, S was asked “Why don't you just pretend that it's a completely 
new set of pictures: that is, respond as if you've never scen the pictures 
before . . . .” The number of responses which $ changed on the repeated 
fifth trial represented performance alteration, the dependent variable. Obvi- 
ously, any changed responses maximized §’s chance of changing her prior pet 
formance and, therefore, reflected her satisfaction with that performance. 

Within this design, two motivational variables—achievement and dissonance 
—are manipulated. An achievement analysis of the treatments in this experi- 
ment would predict a main effect due to original fifth trial scores. Ss’ doing 
poorly on the fifth trial (HL, LL) should change their original response 
when they are given the opportunity. A dissonance analysis, however, would 
predict that Ss whose performances did not follow from their expectancies 
(LH, HL) would alter these performances. "This latter prediction is, of 
course, of greater interest, specifically as it relates to the LH group. Ss in 
this treatment should deliberately alter objectively good performances becaust 
they are dissonant with (low) expectations. 

Aronson’s data indicated a significant main effect due to fifth trial scores: 
that is, Ss did show achievement motivation. However, he also concluded 
in favor of the dissonance hypothesis. The investigator reasoned that two 
relationships were required in order to show the efficacy of dissonance moti- 
vation. First, the LH group must show more change than the LL group and, 
secondly, there must be a significant interaction term: that is, HL + LH must 
be larger than HH + LL. These relationships were obtained in the data. 
present paper argues that both of these criteria are fallacious tests of the dis- 
sonance hypothesis. In order to expedite this argument, let us consider the 
treatments within the Aronson design. 
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he HH condition may be taken as a baseline or control. Neither S's good 
orm nces nor the balance between expectations and ensuing performances 
should motivate changed fifth trial responses. 

- The HL condition is the exact opposite in that fifth trial performances are 
only poor, but also dissonant with expectations. Both achievement and 
dissonance motivation are acting. 

"The LL condition should show change due to achievement motivation caused 
‘simply by poor fifth trial performance. 

The remaining LH condition is, in fact, the only crucial condition for the 
dissonance hypothesis. In this condition, changed responses would be attribut- 
ble solely to dissonance. 

Aronson's criteria for assaying the role of dissonance in response alteration 
“may now be considered. The first criterion, involving a comparison of the LH 
to the LL condition, is not a comparison of a dissonance with a nondissonance 
condition. Rather, it is a comparison of a dissonance with an achievement con- 
dition. It may be considered an estimate of the relative magnitudes of the 
dissonance and achievement effects, but in the presence of insignificant differ- 
ences, one is not free to reject the dissonance hypothesis. 

The second and crucial criterion's validity rests upon assumptions that are 
not immediately obvious. The interaction test is valid only if dissonance and 
achievement motivations summate in the HL condition, The interaction term 
(HL + LH = HH + LL) as a test of the dissonance hypothesis requires 
that achievement motivation occur equally on both sides of this equation in 
order to allow the effects of achievement to be partialed out. If the two 
sources of motivation fail to summate in the HL group—that is, if they inter- 
act—an estimate of the dissonance effect is necessarily contaminated by failure 
to partial out effects due to achievement motivation. In order to employ 
Aronson’s interaction test of the dissonance hypothesis it must be shown that 
"the two sources of motivation do, in fact, summate. Should this be the case, 
the expected value for performance alteration in the HL group would be 
given by the equation HL — HH + (LL — HH) + (LH — HH). That 
is, change in the HL group must equal the base rate of change plus both pure 
"achievement and dissonance effects. 


_4 As is the case in all attempted replications excepting only Brock, et al. (2). 
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This study yielded an insignificant interaction term, the only reliable result 
being a main effect due to performance. Therefore, with one exception, 
decisions vis-a-vis the role of dissonance in performance alteration based upon 
Aronson’s criteria are necessarily inconclusive. Examination of Table 1 clearly 
suggests that both positive and negative findings relevant to the dissonance 
hypothesis may well have been produced artifactually. 


TABLE 1 
EXPECTED AND OBSERVED CHANGES IN FirrH-TRIAL Responses FOR HL 
TREATMENTS IN PERFORMANCE ALTERATION STUDIES 


Study HL expected HL observed me 

Aronson © Carlsmith (1)a 13 11.1 2.96% 
Ward & Sandvold (5) 7.6 8.1 

Cottrell (3) 9.38 3.25 10.22 
Waterman & Ford (6) 11.5 5.2 7.68 
Lowin & Epstein (4) 11.6 77 494 
Brock, et al. (2)4—Ia 8.2 5.3 +57 

—п 6 2:7 3.54 


** Unless otherwise indicated, all ¢ values in the table are significant by two-tailed 
tests with probabilities of .01. 


^ Heterogeneous variances. 
b p < .02. 


© t-test performed by assuming nine Ss per group: that is, a conservative test in that 
two groups actually contained 10 Ss. 


4 The first two of seven studies reported in the paper. 


An alternative method of analysis is available. This procedure involves à 
comparison of all three experimental groups (HL, LH, LL) to the baseline 
(HH) condition. All three groups should evidence significantly greater per- 
formance alteration than the HH condition. 'The only crucial comparison 
for the dissonance hypothesis is between the LH and HH groups. If the LH 
group evidences greater performance alteration than the baseline condition, 
the dissonance hypothesis would be supported. Two separate experiments 


designed to investigate aspects of the dissonance-performance alteration prob- 
lem are reported below. 


C. ExPERIMENT 15 


The first experiment is a replication o£ the essential features of the Aronson 
design. Several minor changes were, however, introduced to insure the attention 
of the S, as well as for purposes of procedural convenience.9 These modifica- 


5 Gratitude is expressed to Jon Bunce, wh 
C at a G » Who collected these data. 


A 4 » Рр. 296-297) discuss several minor modifications of the original 
Tonson design employed in their first two studies, as well as in other attempted 
replications. The evidence indicates that such slight modifications do not affect findings. 
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have been experimentally manipulated in previous studies and it is clear 
t they do not significantly alter the respective findings. For example, the 
portrayed himself to the $ in the role of a physician: that is, experimenter 


‹ 's score to him rather than having the S score а rigged protocol. As in prior 
dies, after the $ had completed the original fifth trial, Aronson’s ruse 
employed to justify having the $ repeat this segment of the task. 


D. RESULTS or EXPERIMENT 1 


Mean number of altered fifth trial responses for the four treatments were 
5.1, 7.2, 7.4, and 7.4 for the HH, HL, LL, and LH groups respectively. 
The expected (9.7) and observed (7.2) values for performance alteration in 
the HL group differ significantly (p < .01). As was the case in previous in- 
vestigations, this finding voids use of Aronson’s interaction test of the disso- 
nance hypothesis. Therefore, all three experimental treatment means were 
compared with the baseline HH condition by means of Dunnett's test (7). 
These analyses indicated significantly greater response alteration in both the 
LL and LH conditions. Differences between HL and HH conditions were 
not significant. Collectively, these separate tests represent a summary decision 
with a probability of a Type 1 error not greater than .05. Greater response 
alteration in the LL condition may be attributed to Ss desires to improve what 
they took to be objectively poor performances. Similar changed responses in the 
LH condition must be interpreted as support for the dissonance hypothesis. 


E. EXPERIMENT 27 


Prior studies (2, 4, 6) of response alteration utilizing the Aronson experi- 
mental paradigm have attempted to show that Ss in the various treatments 
evidence differential recall. Since none of these studies has reported a sig- 
nificant expectancy X performance interaction, their conclusions are not subject 
to the criticisms directed at the interaction test. Rather than evidencing a 
significant interaction, all three studies show a main effect due to Ss’ expec- 
tancies. In these studies $ was not told to pretend he had never seen the cards 
before, but rather was instructed to recall as many of his original fifth trial 
responses as he could. Ss with low expectancies recalled fewer original re- 
‘sponses in all three investigations. 


oa 


7 Gratitude is expressed to Louis Keefer, who collected these data. 
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The second experiment was a replication of the first with the memory set 
substituted for Aronson’s ruse. Ss were instructed to recall as many fifth 
trial responses as possible.® 

The mean number of errors in recall of responses made on the fifth trial 
are 4.3, 5.5, 5.6, and 4.6 for the HH, HL, LL, and LH groups respectively. 
An analysis of variance of these data appears in Table 2. It may be noted 
that expected (5.9) and observed (5.5) values for the HL group do not 
differ significantly: that is, in this experiment the assumption that dissonance 
and achievement summate in the HL condition seems tenable. The only 
finding approaching significance is a main effect due to performance: that is, 
$s who do poorly on the original fifth trial tend to recall fewer of these 
responses. 


TABLE 2 
ANALYSIS ОЕ VARIANCE, RECALL DATA 
Source df Ms F 
A (expectancy) f .8 
B (performance) 1 24.2 349* 
AXB 1 2 
Error 76 6.93 


*.05 << p < 10. 


F. Summary 


Prior investigations of the role of both achievement and dissonance moti- 
vation as factors in response alteration have employed Aronson’s interaction 
test as the principal test of the dissonance hypothesis. This test of the hypothe- 
sis is valid only if dissonance and achievement summate in the HL condition. 
A reanalysis of seven relevant studies indicated that this assumption was not 
at all tenable in six of the studies. Therefore, a simpler test of the dissonance 
hypothesis—a direct comparison of the LH and HH treatments—is required. 
A replication of Aronson’s experiment employing this statistical test of the 
dissonance hypothesis indicated that Ss in the dissonance (LH) condition did 
change significantly more fifth trial responses than the baseline (HH) condi- 
tion.? Achievement motivation was also a significant determinant of response 
alteration (LL > HH), a result found in most prior studies. 

The second experiment indicated that Ss who performed poorly on the 
original fifth trial recalled fewer of their responses, This finding, of course, 


д Additionally, E did not assume the role of a physician. 
Cottrell (3) also reported, as an incidental analysis, the finding that HH, LH, 
and LL groups showed more performance alteration than the HH group. 
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also lends itself to a straightforward dissonance interpretation. Prior studies 
had indicated that Ss who expected to do poorly recalled fewer responses, a 
result not supported in the present data. Interestingly, neither the present, 
nor prior studies support the hypothesis that performance dissonant with 
expectancy produces inferior recall: that is, a significant interaction term. 
While this failure might be attributed to the interaction of dissonance and 
achievement in the HL conditions in prior studies, this was not the case in 
the present replication. Apparently, performance dissonant with expectation 
does not affect recall. Rather, performance (low) dissonant with Ss’ own 
conception of their personal competence (assuming this to be high) serves 
to attenuate recall. 
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LAWRENCE ANTLER, HERBERT Н. ZARETSKY, AND WALTER RITTER? 


A. INTRODUCTION 


A considerable amount of research in recent years has investigated the rela- 
tionship between the Gordon Personal Profile and various measures of success 
(1, 2, 4). Generally speaking, acceptable correlations up to .62 have been 
obtained between this measure of personality, on the one hand, and trait 
ratings and performance criteria, on the other. The samples for which the 
practical validity of the Gordon Personal Profile have been assessed were 
generally drawn from the labor force in the fields of commerce and industry. 

The purpose of the present study was to assess the ability of the Gordon 
Personal Profile scales to predict adjustment and performance for a sample 
of medical residents. "These residents were in postgraduate medical training 
programs in various specialties in the United States. The Gordon Personal 
Profile was selected for administration to these groups, since the personality 
traits measured by the scales—e.g., Ascendancy, Responsibility, Emotional 
Stability, and Sociability—appear to be related to the commonly held stereo- 
type of the United States physician. The results of a previous report (7) 
indicated that both cultural and vocational factors were related to the self- 
percept of the physician. The foreign physician was included in the present 
study to compare the practical validity of the Gordon Personal Profile for 
United States and foreign physicians. The use of the Gordon Personal Profile 
on a foreign sample seemed feasible in the present case. The Ss were in the 
identical profession, were English-speaking, and were working in United 


* Received in the Editorial Office, Provincetown, Massachusetts, on June 3, 1967. 
Copyright, 1967, by The Journal Press. 

1 "This investigation was supported, in part, by a research grant, number RD-819-M, 
from the Vocational Rehabilitation Administration, Department of Health, Education, 
and Welfare, Washington, D. C. 20201. The authors wish to express their appre- 
ciation to Dr. Howard A. Rusk, Project Director; Dr. Michael M. Dacso, Coproject 
Coordinator; to Dr. Jacob L. Halberstam, Coproject Coordinator, their special appre- 
ciation for his editorial comments; and to the other members of the staff who assisted 
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States hospitals and living in United States cities. Thus, the present stu 4 
seemed to circumvent some of the untenable assumptions concerning 
equivalence of meaning, stimulus, and response that have plagued cros 
cultural comparisons (5). 
B. METHOD 
1. Subjects 


"The Ss were selected from a total sample of 281 Ss consisting of 139 Unit 
States and 142 foreign male residents in university-affiliated hospital те ide 
training programs located primarily in the Northeastern states of the U 
States. It should be pointed out that these Ss were the same groups sti 
by Marsh and Halberstam (7). Approximately 10 per cent of the 
sample of Ss were located in the Midwest and the West. The United 
residents, with a mean age of 30.9 years (SD = 4.9), were somewhat 
than their foreign counterparts, whose mean age was 33.1 years (SD = 
Of the United States residents, 50 were in surgery, 47 in internal mec 
and 42 in physical medicine and rehabilitation (PM&R). Of the 
residents, 49 were in surgery, 36 in internal medicine, and 57 in PM&R. 
foreign residents! home countries were tabulated according to the 
regions from which they came, following the categorization of home cou 
by the Institute of International Education (6): 35 residents came 
the Far East, 30 from the Near and Middle East, 41 from Europe, 34 fro 
Latin America, and two from Africa. All of them had completed their b 
and medical education abroad. The United States residents were born 
educated in the United States. Of the United States group, 46 were in t 
first year of residency, 39 in their second year, 39 in their third year, 
in their fourth year, and four and two in their fifth and sixth years, 
tively. Of the foreign residents, 46 were in their first year, 43 in their 
year, 29 in their third year, 16 in their fourth year, and four each in 
fifth and sixth years. 

The Ss included in this study were the 15 highest-scoring and 15 lo 
scoring Ss on each scale of the Gordon Personal Profile. According to Go 
(4), the validity coefficients of the scales of the Gordon Personal P 
represent linear relationships, “. . . but it may very well be that with n 
types of performance criteria linearity of relationship is the exception 
than the rule. The reported validities, therefore, may often be underestimal 
of the true relationship” (4, p. 17). On this basis, it seemed advisab 
select the extreme groups for the analysis of the data. Analyses of each 
the four Gordon scales were then made of the extreme groups selected frc 
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the foreign residents, Similar analyses were separately made for the United 
States residents. The samples for each of the Gordon scales were not entirely 
independent, since some overlap was found. 


: 2. Instruments 


A number of new instruments were specially designed or adapted for use 
in this study. These included a background information questionnaire and 
measures of adjustment and performance. The criterion measures were taken 
from the following scales: 


1. Contacts with United States, Home Country, and Foreign People 
General Satisfaction—United States 

Academic Satisfaction 

Satisfaction with Supervision 

Satisfaction with Present Hospital 

Satisfaction with Training 

Overall Program Evaluation 

Expectations 

. Doctors Rating 

10. Residents’ Performance Rating Form 


Kuder-Richardson reliability coefficients of .75 to .97 were obtained for 
the new instruments analyzed in this study. Each resident's supervisor was 
asked to evaluate the resident's performance on a specially designed rating 
form which included such variables as knowledge of basic sciences and clinical 
medicine; knowledge of the English language, both written and oral; person- 
ality characteristics; and overall performance. The Kuder-Richardson reli- 
ability coefficients for the two factors extracted from the residents’ perfor- 
mance rating form—a verbal factor and a performance factor—were .90 and 
.96, respectively. 


ю юмо Ф о 


3. Procedure 


The Ss were randomly selected by the investigators from lists of residents 
supplied by chairmen of the medical departments and chiefs of services in 
university-affiliated teaching hospitals. The instruments were administered 
by trained field-work interviewers, most of whom were graduate students in 
psychology. The interviewers were thoroughly briefed in the procedure and 
intent of the research instruments. Cooperation from departmental heads and 
residents was outstanding, with a refusal rate of less than 1 per cent. 

Among the low-scoring foreign Ss, the average range of the scores on the 
Ascendancy, Responsibility, Emotional Stability, and Sociability scales was 
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7.8 to 15.3; for the high-scoring foreign Ss, the average was 29.3 to 
Among the low-scoring United States Ss, the average range was 7.5 to H 
for high scorers, the average range was 30.5 to 33.5. 


C. RrsuLTS 


l. Foreign Residents 


Within the foreign group, a comparison between the extreme scorers 
each scale of the Gordon Personal Profile with respect to the criterion m 
sures of adjustment and performance indicated that thc Ascendancy $ 
discriminated scores on only one of the 13 criterion measures: c.g., Conti 
with United States People. The observed means were 18.9 (SD — 41) f 
the high scorers and 13.6 (SD = 2.0) for the low scorers. The ¢ ratio! 
the difference between the observed means was 3.41 (p < .01), indicat 
that those Ss who reported themselves as highly ascendant also reported m 
contacts with United States people. 

The Responsibility scale discriminated only the amount of satisfaction ! 
supervision. High scorers reported significantly more satisfaction with t 
supervision than did low scorers. The means were 32.1 (SD = 5.1) @ 
27.1 (SD = 4.7), respectively. The ¢ ratio of the difference between t 
observed means was 2.66 (p < .05). 

Table 1 shows that the Emotional Stability scale yielded 11 signific 
comparisons out of the 13 that were made. Thus, those Ss who repon 
themselves as highly stable emotionally also reported more satisfaction Wi 
various aspects of the training programs and their United States sojou 
experiences. On the Doctors Rating Scale, which is reverse scored, 
rated United States physicians superior to foreign physicians as practi 
of the same profession. However, it was the low scorers on the Em 
Stability scale who tended to rate the United States physicians even 
They also reported more realistic expectations about working in the Ur 
States compared to the low scorers. The high scorers on the Emoti 
Stability scale were also rated higher by their supervisors in their 
with the English language and in their performance. А 

In order to determine whether or not the Emotional Stability scale ¢ 
criminated on other measures of personality, the performance of the ext 
scorers was compared on the Ascendancy, Responsibility, and Sociability s 
"Table 2 shows that the high scorers on the Emotional Stability scale obti 
significantly higher scores on the Ascendancy and Responsibility measures; © 
no difference was observed between the means for the Sociability scale. _ 

Finally, the Sociability scale discriminated between high-scoring and k 
scoring residents on only one of the criterion measures. Ss who reported i 
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TABLE 1 
Meaw Scores, SrAxDAxD Deviations, АХО ¢ Ramios or Hion лхо Low Scones 
ON THE EMOTIONAL STABILITY Scag or тик Сокоох PrxsowAL Promina 


High scorers Low scorers 
(N = 15) (М = 15) 
Criteria Mean SD Mean 5D t 
Adjustment 
Contacts with 
United States People 1747 $39 1607 5.10 2.92% 
Home Cou! People 12480 4.66 1535. 513 oo 
Foreign e 1340 5.08 1427 27.56 NS 
General Satisfaction—U. 8, 19.13 406 19.66 447 NS 
Academic Satisfaction 2147 3.18 1840 147 4 09e 
Satisfaction with Supervision 2900 5.25 2627 377 247* 
Satisfaction with Present 
Hospital 23.80 322 2127 425 3.73•• 
Satisfaction with Training 40.80 5.13 36.67 $44 10.07% 
Overall Program Evaluation 12.53 1.39 9.67 249 6.50% 
Expectations $27 1.09 767 1M 4.10** 
Doctors Rating 2600 400 23.07 456 4.60** 
Performance 
Supervisors’ Rating of English $47 102 7.33 72 5.35% 
Supervisors’ Rating of 
erformance $5.20 12.01 6947 15.20 3.75% 
op < OS. 
** $ c. 01. 
TABLE 2 


MEAN Scores, STANDARD DzviATIONS, AND ¢ Ratios ОР Ніси лхо Low SCORERS 
ON THE EMOTIONAL STABILITY SCALE OF THE GORDON PERSONAL PROFILE 


High scorers Low scorers 

Gordon Personal Profile (N z 15) (N = 15) 
scales Mean SD Mean SD t 
Ascendancy 22.00 2.37 1747 5.60 3.15* 
Responsibility 29.07 2.57 19.07 3.59 7.94* 
Sociability 18.53 348 1.80 9.44 NS 


*p« л. 


great amount of sociability in their self-percept also reported significantly 
more contacts with United States people than did the low scorers. The high 
scorers obtained a mean of 17.5 (SD — 4.3) on the Contacts with United 
States People scale; the low scorers obtained a mean of 12.9 (SD — 1.7). 
The ¢ ratio for the mean difference was 4.63 ($ < .01). 


2. United States Residents 


As far as the United States residents were concerned, the high and low 
scorers on the Gordon Personal Profile scales did not differ significantly in 
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the scores they obtained on the measures of adjustment and performance by 
2 test. Only one comparison in 48 was significant. 

'To check on the possibility that cross-cultural differences between United 
States and foreign residents could account for the results that were obtained, 
a number of additional analyses were made. Comparisons on the criterion 
variables were made between (a) high-scoring foreign and United States $5 
and (5) low-scoring foreign and United States Ss. No systematic differences 
were observed between foreign and United States Ss. Among the total sample 
of 170 foreign physicians (including females), no relationship between the 
length of time in the United States and scores on each scale of the Gordon 
Personal Profile was found. Furthermore, facility with the English language, 
as rated by each foreign resident’s supervisor, was not related to scores on 
the Ascendancy, Responsibility, and Sociability scales. A low r of 16 (2 < 
05) was found for the Emotional Stability scale. 


D. Discussion 


It is apparent from the results of this investigation that the practical validity 
of the Gordon Personal Profile was obtained only for the Emotional Stability 
scale and only within the foreign sample. An analysis of the background 
information obtained from the high- and low-scoring foreign Ss on the Emo- 
tional Stability scale indicated features that may be logically associated with 
emotionally stable persons. Thus, the high scorers tended (a) to be married 
or had been married; (b) to claim a religious affiliation; (c) to report more 
frequent attendance at religious services in their home countries; and (4 ) to 
have participated in more club activity before entering medical school, as 
compared to the low scorers. In addition, 53 per cent of the high scorers 
reported that their specialty was accorded more respect than other specialties 
in their home countries. Among the low scorers, only 21 per cent reported 
that their specialty was accorded more respect. 

The indirect evidence that was found in this study for the characteristics 
of emotionally stable persons also appears to be related to Gordon's descrip- 
tion of the meaning of high and low scores on the Ascendancy and Respon- 
sibility scales (4, p. 3). 

Insofar as the foreign sample was singled out for consideration, it would 
appear that the Emotional Stability scale had merit in discriminating adjust- 
ment and performance. When the results for the foreign and United States 
samples are considered, however, situational factors may explain the dis- 
criminating power of the Emotional Stability scale in the present case. The 
adjustment problems of the foreign exchange student in the United States 
have been well documented, as a recent survey of the literature indicates (3). 
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It would appear that the Emotional Stability scale was most sensitive to the 
adjustments made by a group of persons who were in a situation quite at 
variance with that which they had known all their lives and which required 
an emotionally stable personality in order to cope successfully with the stresses 
of a new environment. In contrast, the United States physicians, remaining 
in their homeland as they did, did not have to adapt to a new environment. 
In the latter instance, the present authors conclude that the Emotional Stability 
scale has merit in the assessment of adjustment and performance of foreign 
graduate medical exchangees in the United States. 


Е. SUMMARY 


Two hundred eighty-one United States and foreign medical residents in 
training in university-affiliated hospitals were administered the Gordon Per- 
sonal Profile, and the 15 highest and 15 lowest scorers on each of the four 
scales were selected for inclusion in the study. The results indicated that, 
among the foreign group of residents, the extreme scorers on the Emotional 
Stability scale were strongly differentiated with respect to criterion measures 
of adjustment and performance. Moreover, high scorers on the Emotional 
Stability scale obtained significantly higher scores on the Ascendancy and 
Responsibility scales of the Gordon Personal Profile. Similar comparisons 
among the United States group of residents were generally inconclusive. 
Support of the practical validity of the Gordon Personal Profile was dis- 
cussed, as well as the relationship of the present results to the assessment 
of adjustment among foreign medical exchangees training in the United States. 
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ABILITY TO DIFFERENTIATE MESSAGE FROM SOURCE 
AS A CURVILINEAR FUNCTION OF SCORES ON 
ROKEACH'S DOGMATISM SCALE* ! 


Department of Psychology, University of Manitoba 


GILBERT BECKER 


A. INTRODUCTION 


According to Rokeach (17), open-mindedness can be conceptualized in terms 
of the extent to which information and the source of such information are 
independently evaluated. An individual is open-minded insofar as he “can 
receive, evaluate, and act on relevant information received from the outside 
on its own intrinsic merits ....” (17, p. 57). Empirical findings consistent 
with this conceptualization have appeared in two studies. Vidulich and Kaiman 
(19) have reported that open-minded female Ss (low scorers on Rokeach’s 
Dogmatism scale) were better able to judge the directional “movement” in 
an autokinetic situation independently of the judgment of a high-status con- 
federate. Powell (16), using a different class of information, has found that 
open-minded male and female Ss (low scorers on Rokeach's Dogmatism scale) 
were better able to evaluate political statements attributed to the two major 
candidates in the 1960 United States presidential election independently of 
the evaluation of the candidates. Although the data were available, Powell's 
analysis did not include subject-sex comparisons. 

In both of the above-mentioned studies it is difficult to infer the nature of 
the function obtaining between Dogmatism scores and dependency of evalua- 
tion of information because only two levels of Dogmatism scores were used 
in the comparison of the experimental results. Although a curvilinear func- 
tion can just as easily be fitted to two points as a rectilinear function, it was 
tacitly assumed in both studies (through the omission to indicate other possi- 
bilities) that the functions were rectilinear. Although the concept of open- 
mindedness may be rectilinearly related to the ability to differentiate message 
from source, it must be remembered that we are not dealing with the concept 


* Received in the Editorial Office, Provincetown, Massachusetts, on June 3, 1966. 
Copyright, 1967, by The Journal Press. 

1 Appreciation is expressed to the following students who helped in the collection 
and ordering of the data: Ignez Koptcke, Mary Lousteau, Richard Milam, and Joyce 
Kaplan, The facilities of the Tulane Bio-Medical Computer Center, sponsored by 
NIH Grant FR00006, were used in the statistical analysis of the data. 
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directly but through the mediation of responses to verbal stimuli, which are 
notoriously subject to a variety of forms of distortion (8, 9). 

The present study was designed with three purposes in mind. First, the 
author wanted to test the generality of Vidulich and Kaiman’s and Powell’s 
findings with respect to a new class of information and source: namely, 
humorous "messages" (jokes) and comedians to whom they are attributed. 
Second, the author wanted to test whether or not the function found in the 
new context is, in fact, rectilinear. Third, because the findings in too much 
research involving only one sex (or both sexes but without separate analyses) 
have been generalized (or particularized), many times with error, to both 
sexes, the author wanted to test whether or not sex-related differences exist. 


B. METHOD 
1. Subjects 


The population sampled consisted of predominantly first-year students en- 
rolled in various introductory psychology sections at the New Orleans campus 
of Louisiana State University. Three independent samples were selected at 
random from this population. Each sample was divided evenly with regard 
to sex. The two smaller samples were used to obtain normative data to be 
used in testing the main sample. 


2. Procedure in Collecting Normative Data 


Ss in the first sample (N = 50) were asked during class to write down the 
names of three comedians who were the most liked and three who were the 
least liked. The 51 names recorded were assumed to represent comedians 
known to the population being sampled and to reflect various degrees of 
popularity. 

Ss in the second sample (N = 80) were asked, in groups of from 10 to 20, 
to Q-sort into seven piles, from least liked to most liked, 67 jokes typed on 
individual cards in either one of two packets randomly created from 4 pool 
of 134 jokes. The jokes in each pile were assigned values from 1 through 7. 
Six jokes for use in the main part of the experiment were selected on the basis 
of (a) central location in the distribution of mean values, (4) low variability 
of individual values around the mean value, and (c) little or no sex differences- 
The mean value and range of the six jokes were, respectively, 4.00 and 45. 


3. Procedure in Collecting Differential Data 


Ss in the main sample (N = 150) were given, during a regular class sês- 
sion, a list of the names of the 51 comedians obtained earlier and asked to 
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select the one most fitting each of six descriptions ranging from “best-liked” 
to “least-liked” before appearing individually for the main part of the experi- 
ment. At that time $ was given the set of six jokes, one joke at a time in 
random order, and asked to read each one carefully. As each joke was handed 
out it was attributed to one of the six comedians previously selected by S, in 
fixed and staggered order of preference: that is, first, sixth, second, fifth, 
third, and fourth. After the last joke was read, $ was asked to rank-order the 
six jokes from “most funny” to “least funny.” 

The 40-item Dogmatism scale (Form E), devised by Rokeach (17) to 
measure open-mindedness, had been administered collectively to the main 
sample early in the semester and at least six weeks before the jokes were first 
presented for evaluation. 


4. Statistical Treatment 


The sum of the squared differences in rank order (Xd?) between the six 
comedians individually selected and the six jokes attributed to them was the 
score used to indicate the ability to differentiate message from source. ‘This 
score ranged from zero to 70, and could be converted into a rank-difference 
correlation coefficient (rho) by multiplying it by the constant 6/N (N? — 1) 
(which in the present case is 1/35) and subtracting from unity. Mean scores 
for each group could be converted into a mean rank-difference correlation 
coefficient by the same procedure and by using the same constant, as indicated 
from the formulas developed by Taylor and Fong (18). Thus, a mean score 
of 35 would correspond to a mean rho of zero and indicate perfect ability to 
evaluate message and source independently. Scores higher than 35 would 
correspond to a negative coefficient and scores lower than 35, to a positive 
coefficient. 

The 75 male and 75 female Ss were trichotomized into equal subgroups of 
25 depending on their Dogmatism score, and characterized as members of the 
low, medium, or high Dogmatism subgroup. A 3 X 2 factorial design, based 
on the Dogmatism trichotomy and the subject-sex dichotomy, was used in 
analyzing the variances. 


C. RESULTS 


Table 1 includes the mean X4? score for the entire sample, each subgroup, 
and each cross-comparison group. Figure 1 depicts the mean Sd? score as a 
function of Dogmatism-subgroup membership for each sex separately and com- 
bined. F tests indicated reliable main effects for both Dogmatism (F = 3.76, 
df = 2/144, p < .05) and Sex (F = 4.07, df = 1/144, p < .05). 
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TABLE 1 
Mean Xd? Scores 


Dogmatism Subgroup 


Subjects Low Medium High Total 
Male 26.00 37.88 28.16 30.68° 
Еета!е 24.56 28.72 21.68 24.998 

Total 25.2$5,4 33.304 24.9254 27.846 


* | between means of low and medium Dogmatism subgroups = 2.25, df = 9%, 
p < 05. 


b ¢ between means of high and medium Dogmatism subgroups = 2.41, df = 9% 
} < .02. 

© f between mean of entire sample and hypothetical mean of 35.00 (which cor- 
responds to a mean rho of zero) = 5.01, df = 149, p < .001 (15, p. 107). 1 

4 t between means of low, medium, and high Dogmatism subgroups, respectively, 


and hypothetical mean of 35.00 = 3.95, df = 49, p < .001, less than unity, and 441, 
df = 49, p < .001. 


* t between means of all males and all females, respectively, and hypothetical mean 
of 35.00 = 2.05, df = 74, p < .05 and 5.16, df = 74, p < .001. 


D. Discussion 


If the author assumes that the general ability to differentiate message from 
source has been tapped by his experimental operations, it follows that this 
general ability, an essential characteristic that defines open-mindedness ac- 
cording to Rokeach, cannot be predicted from scores on the Dogmatism scale 
in any simple fashion, as has previously been assumed. Rather, as the data 
indicate, the author would predict that Ss in the middle range of the distribu- 
tion of Dogmatism scores tend to be open-minded, and that Ss either below of 
above that range tend to be closed-minded. An interesting question is raised 
immediately: what differentiates closed-minded Ss scoring low on the Dog- 


matism scale from those scoring high? The best answer seems to involve the 
Tesponse set determined by the testing situation. 


1. Low-Scoring Dogmatics and Denial-Response Set 


The term denial-response set as used here refers to a general phenomenon 
associated with the need for social approval (10), the tendency to be insensitive 
to negative attributes of self (6; 10, pp. 130-132), and the tendency in gen- 
eral to use mechanisms of repression, denial, and reaction formation in response 
to threat (4). A look at a few of the statements on the Dogmatism scale 
suggests the ease with which Ss with a tendency to deny socially undesirable 
Statements, or those that reflect a negative self-image, might disagree with 
those statements: No. 7. Most people just don’t give a ‘damn’ for others; 
No. 9. It is only natural for a person to be rather fearful of the future; No. 19. 
There are a number of people I have come to hate because of the things they 
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stand for. 'These statements seem incompatible with the verbal behavior of 
the person whose social and personal image must reflect a concern for others, 
a certain degree of fearlessness concerning the future, and an inability to hate 
any of God's creatures," respectively. 

Two sets of data lend support to the above analysis. The first set of data 
was obtained from University of Texas students by Byrne and his associates. 
Byrne et al. (6) reported a correlation between scores on Byrne’s revised 
Repression-Sensitization? (R-S) scale and self-ideal (S-I) discrepancy scores 


2 A high score on the R-S scale indicates a tendency, in response to threat, to use 
sensitizing mechanisms such as intellectualization, isolation, and lowered recognition 
thresholds in contrast to avoidance responses, such as repression, denial, and reaction 
formation. 
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on Worchel’s (20) Self Activity Inventory (SAI) (r = .63). The relation- 
ship was found to be a function of negative self-description rather than of 
S-I discrepancy as such. This may be interpreted to mean that repressors tend 
to be insensitive to negative self-attributes. In an unpublished report cited by 
Byrne (4), scores on the Marlowe-Crowne Social Desirability (SD) scale 
(10) were found to be related to scores on the R-S scale (r — —.37). This 
is to say that repressors tend to give socially desirable responses and is con- 
sistent with Crowne and Marlowe's (10, pp. 97-185) thesis that approval- 
dependent persons tend to use avoidant, repressive defenses against hostility, 
anxiety, fear of rejection, etc. Finally, and more directly, Byrne and Goldberg 
(5) reported a correlation between scores on Rokeach’s Dogmatism scale and 
R-S scores (r = .36). 

The second set of data was obtained in a way more comparable to the 
present findings, Becker and DiLeo (2) administered the Dogmatism scale, 
the SD scale, and the SAI to male and female students who were subsequently 
divided into low, medium, and high Dogmatism subgroups. SD and S-I dis- 
crepancy scores in the three subgroups were then compared. Differences among 
the three SD means were not statistically reliable. Differences among the 
three S-I discrepancy means, however, approached significance and the ¢ ratio 
calculated between the means of the low and high Dogmatism subgroups indi- 
cated an acceptable level of reliability. 


2. High-Scoring Dogmatics and Agreeing-Response Set 


It must be noted that the Dogmatism scale contains only positively worded 
items, agreement with which presumably indicates Dogmatism. A scale of 
this kind is susceptible to a response set to agree (8, 9) and fails to distinguish 
between closed-minded individuals (as defined by the content of the scale) 
and high agreers. Couch and Keniston (7) have created a self-report scale 
which presumably measures the generalized tendency to agree with items 
regardless of content. This tendency was found to define partially a person- 
ality syndrome characterized essentially by “stimulus acceptance” (high 
agreers) vs. “stimulus rejection” (low agreers). Couch and Keniston have 
reported a correlation between scores on their scale and Dogmatism scores 
(ғ = 40). 

Lichtenstein, Quinn, and Hover (14) have found Dogmatism scores to 
relate to scores on two independent measures of agreeing-response set. The 
first measure was based on items taken from a measure of Impulsivity. The 
second measure was the total number of MMPI True responses. The two 
sets of scores, respectively, correlated .47 and .51 with scores on the Dogmatism 
scale. 
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3. Implications of the Response-Set Interpretation 


The response-set analysis has certain implications for the reinterpretation 
past findings in connection with the behavior of high and low scorers on the 

atism scale (and for that matter, on other scales that are similar in 
tent and structure). For example, let us consider some additional findings 
rted by Vidulich and Kaiman (19). Thirty high and 30 low scorers on 
Dogmatism scale were tested for differences in degree of conformity to 
high- and low-status sources of information on the directional “movement” in 
autokinetic situation. It was found, in one analysis, that high scorers 
tended to be influenced more by a high-status source than by a low-status 
source, whereas low scorers showed no differences and tended to score inter- 
mediately. In terms of our response-set analysis the above findings might be 
interpreted in the following way. High scorers, because of inadequate impulse 
control, tended to agree with the high-status source as they are known to do 
with opinionnaire items. For the same reason, high scorers tended to dis- 
agree with the low-status source; this time, the impulse was negative—one 
‘tending to ignore or even to counter information stemming from “inadequate” 
sources. Low scorers, on the other hand, because of their greater tendency to 
suppress, perhaps overcontrol spontancous impulses, tended not to follow their 
inclinations both to conform to the high-status source and to counterform to 
the low-status source, thus explaining their intermediate position. 

This implication can easily be tested. Ss varying on some experimental 
measure of open-mindedness, but matched on some scale of agrecing-response 
set (perhaps that of Couch and Keniston), could be tested for influencibility 
under high- and low-status confederate conditions. Ss varying on agreeing- 
response set but matched on open-mindedness could be similarly tested. The 
‘results would indicate whether the differential effects of status are due to 
‘differences in open-mindedness or “stimulus-rejection” or both. 


4. Sex Differences 


The sex-related differences found are not surprising in view of previous 
reports of greater social dependency in women. Females have been found to 
be more susceptible to persuasive communications (13), more socially affil- 
dative when experimentally made anxious and asked to wait their turn for the 
expected noxious event (12) or to participate in the noxious event (1), more 
likely to volunteer for experiments promising interaction with others (3), 
re conforming in a modified Asch situation (11), and to have a greater 
eed for social approval (10, pp. 209-212). Our finding merely adds to the 
ay of previous findings on the greater social dependency of women. 
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5. Methodological Considerations 


The importance of dividing samples into at least three groups has been 
demonstrated in this study. The use of median-determined groups or of two 
extreme-scoring groups would have resulted in the conclusion of either no 
relationship or a monotonic relationship (the latter being suggested among the 
female Ss). A symmetrical distribution of 24? scores (or other criterion 
scores) will produce the first, and a skewed distribution will produce the 
second. The methodological principle involved here is obvious and will be 
accepted by any competent researcher. 


E. SUMMARY AND CONCLUSIONS 


Undergraduates classified into low, medium, and high Dogmatism subgroups 
rank-ordered according to preference six sources (comedians) and six messages 
(jokes) arbitrarily attributed to the sources. Analysis of the squared rank- 
difference scores suggested the following preliminary conclusions: 

1. The evaluation of messages by undergraduates does not depend entirely 
upon the intrinsic merit (content) of those messages but additionally upon the 
evaluation of the sources presumed to be responsible for them. 

2. Female undergraduates, to a greater extent than do male undergraduates, 
tend to evaluate messages less on the basis of content and more on the basis 
of external factors (source). 

3. Undergraduates who can be classified as either Гош or high on the basis 
of Dogmatism scores tend to evaluate messages less on the basis of content 
and more on the basis of source than do those who can be classified as medium; 
therefore, and in accordance with Rokeach's conceptual definition of open 
mindedness, such undergraduates tend to be less open-minded. 

4. It follows from Conclusion 3 that the Dogmatism scale is invalid as 4 
rectilinear predictor of open-mindedness, though valid as curvilinear predictor 
thereof. А 

Response-set tendencies in connection with the Dogmatism scale were dis- 
cussed (a) as the basis for the invalidity of the Dogmatism scale as a recti- 
linear predictor of closed-mindedness, (b) as the basis for reinterpreting past 
findings involving the Dogmatism (and similar) scales, and (с) as possible 
correlates of experimentally-defined closed-mindedness. 
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А. PROBLEM 


Aronson and Carlsmith (1) have proposed that the disconfirmation of 
performance expectancies arouses cognitive dissonance. The adjustment of 
proficiency at the task would appear to be an effective means of reducing 
the dissonance. Thus, a person who performs well when expecting to do 
poorly would reduce dissonance by decreasing his task proficiency; a person 
who performs poorly when expecting to do well would reduce dissonance by 
increasing his task proficiency. 

Previous work (1, 3, 5) has shown that dissonance resulting from the 
disconfirmation of performance expectancies leads persons to change their 
judgments. The judgment-changes served to replace a dissonant score with one 
consistent with expectation. The judgment-changes, however, were not changes 
in task proficiency because the task of the previous studies had no accuracy 
criteria. In the previous studies the task was to pick the schizophrenic from sets 
of facial photographs. Actually, none of the pictured individuals was schizo- 
phrenic. Thus, it was impossible for subjects to become more proficient or less 
proficient at this spurious task. Another effect of expectancy-performance 
dissonance upon judgment was observed by Cottrell (5). After being allowed 
to change their judgments of the pictures, the subjects estimated their future 
proficiency at the task. These estimates showed dissonance-reduction effects. 
The dissonance subjects rejected the inconsistent feedback by failing to use 
it when they estimated their task proficiency. 

In all of the previous studies the dependent variables were judgmental 
responses. The question is still open as to whether persons adjust their task 
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proficiency in order to reduce dissonance between expected and obtained perm 
formance. The present study was conducted to determine whether the effect 
of expectancy-performance dissonance are confined to judgmental processe 
Subjects performed a simple reaction time task and experienced either con 
firmation or disconfirmation of their performance expectancies. Persons whost 
apparent performance was discrepant from their expected performance mi 


confirmed expectancy, or by rejecting the discrepant information when maki 
estimates of their future success. The present study manipulated strength 
expectancy, as well as the usual experimental variables, level of expected 
formance, and apparent level of final performance. Previous work (2, 3) 
shown that the stronger an expectancy, the greater the magnitude of dissonant 
aroused by its disconfirmation. Accordingly, in the present study it was pre 
dicted that the greatest dissonance-reduction effects would be obtained f 
persons experiencing the disconfirmation of a strong expectancy. 


B. METHOD 
1. Design 


There were three independent variables in a 2 X 2 X 2 factorial arrange 
ment. Six Ss were randomly assigned to each of the eight experimental condi- 
tions. The independent variables were level of performance expected on 
task (Expectancy), strength of expectancy, and apparent final performane 
on the task (Performance). In the customary version of the performano 
expectancy paradigm, one $ is run at a time. This procedure confounds 
particularities of a single experimental session with the combination of a 
perimental variables administered during that session. Cottrell (5) has @Ҹ 
cussed the ways in which this confounding can produce equivocal and mis 
leading results. The present study used a split-plot design to control for 
session-to-session variations in procedure. The Ss were run in groups of four. 
In each session there occurred all four combinations of the Expectancy 4 
Performance variables, The High Expectancy-Low Performance and the 
Expectancy-High Performance conditions were the dissonance conditions. 4f 
High Expectancy-High Performance and the Low Expectancy-Low Perfor: 
mance conditions were the consistent control conditions. Strength of exPe™ 


Expectancy session constituted the first replication of the experiment. - 
other five replications of the experiment were constituted in a similar fashi 
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2. Subjects 


The Ss were 48 University of Michigan coeds. Participation in research was 
a course requirement. The Ss were run in groups of four, The research was 
described as a study of the relationship between reaction time (RT) and per- 
sonality. First their RT was to be measured and then they were told that they 
were cach to go to a different room for a personality interview. Actually the 
interviews never took place. They were mentioned in the instructions to 
heighten involvement and to maintain procedural similarity with previous 
studies. 

3. Procedure 


Several procedures were used to control within-session communication 
and the anticipation of postexperimental score comparisons. The incentive 
value of a high score on the test would probably be affected by the anticipation 
of comparing scores with other Ss after the session. The likelihood of such 
meetings would be minimized if one knew none of the other Ss and perceived 
that there would be no opportunity to meet with them immediately after the 
session. Pre-experimental acquaintance was minimized by requiring that each 
S in a session come from a different course section and live in a different dormi- 
tory. Anticipated postexperimental interaction was minimized by telling the Ss 
that their interviews would be conducted simultaneously in different rooms and 
would last different lengths of time. The Ss were instructed not to talk to one 
another. 

'The group reaction time apparatus (4, 8, 9) was used. The Ss were seated 
at a large table which was divided into separate work areas by wooden par- 
titions. Each $ had a console which contained a stimulus light and a response 
switch. The S’s task was to pull the switch as soon as possible after the onset 
of the stimulus. A two-second warning tone preceded stimulus onset. Each $'s 
reaction time was measured by a Standard 5-1 timer. A RT of less than 200 
milliseconds was described to the Ss as a successful performance and their 
score was said to be the number of successes they achieved. They were told that 
past performances ranged from 20 per cent success to 90 per cent success. 
The Ss were also told that it seemed to be quite difficult for people to evaluate 
their performance on the RT test; that some people who think they do very 
poorly are among the best performers, and vice versa. 

Before they began the task the Ss were given 20 practice trials without 
feedback. The task itself was divided into blocks of 10 trials. Within a block, 
the intertrial interval was 20 seconds. The interblock interval was three 
minutes. 
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After each block E communicated by written note to each $ the number of 
successes she had achieved on the previous 10 trials. This feedback bore ng 
relationship to the S’s actual performance and was manipulated to produce 
variations in the experimental variables. 

Half the Ss were led to expect they were very proficient; their feedback 
indicated 85 per cent success on the first few trial blocks. This was the High” 
Expectancy condition. The other half of the Ss, the Low Expectancy condition, 
were led to expect they were not proficient; their feedback indicated only 25 
per cent success on the first few trial blocks. 

Expectancy strength was varied by manipulating the number of 10-trial 
blocks (either two or four) in the expectancy formation phase. Strong expe 
tancies were created for half the Ss, weak expectancies for the other half. 
In the Strong-High Expectancy condition the initial feedbacks were nine 
eight, eight, and nine correct. The initial feedbacks in the Strong-Low Expec- 
tancy conditions were three, two, two, and three. In the Weak-High Exe 
pectancy condition the initial feedbacks were nine and eight. In the Weak-Low 
Expectancy condition the initial feedbacks were three and two. 

For half the Ss, feedback on the final trial block indicated nine successesy 
the High Performance condition. For the other half the Ss, final block feed- 
back indicated three successes, the Low Performance condition. (The final 
trial block was the fifth trial block for Strong Expectancy Ss and the third t ial 
block for Weak Expectancy Ss.) Feedback on the final block confirmed the 
expectancies of half the Ss and disconfirmed the expectancies of the other half. 

After the Ss had learned their score on the final block, E claimed he would 
have to throw that score out and have the Ss do the final block again. The 
justification for this ruse was as follows. After the Ss had completed the final 
section, E claimed he had discovered an equipment failure. He said that he 
had been measuring the force with which the Ss pulled the switch, as well as 
their reaction time. The amplitude measuring apparatus had not worked on 
the last trial block, E claimed. He asked the Ss to score their successes on ШШ 
final block. The score booklets had been prepared in advance, and the Ss 
discovered they had achieved either nine or three successes. This score eith 
confirmed or disconfirmed their expectancy. After they had finished scoring, Ё 
announced he had completed repairs and that, since the amplitude data were 
so important to the experiment, he would have to throw out the final block 
and have them do another 10 trials in its place. In that way he could measure 
both their response latency and amplitude on the final section. The E then 
collected the score booklets and, without looking at the score on the final block, | 
destroyed what were apparently the records of their final block performante. 
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Next the Ss did another 10 trials and then, without feedback on their replace- 


ment block performance, filled out a questionnaire. 


4. Dependent Variables 


Adjustments of task proficiency were determined by subtracting the mean 
RT on the replacement trial block from the mean RT on the discarded trial 
block. Thus, a positive change score indicates that proficiency improved (RT 
was shorter) on the replacement block. Although feedback was false, the actual 
RTs had been recorded. 

The $$” estimates of their proficiency at the task were assessed by a ques- 
tionnaire item, “Suppose you attempted 10 more trials on the reaction time 
test, how many times do you think you would perform better than the standard 
(react faster than 200 milliseconds) ? Do not check the number of times you 
would like to excel the standard, nor the number you would try to excel the 
standard, but, rather, check the number of times you would probably excel 
the standard." "The Ss checked one of 11 responses which ranged from “zero” 
to “10.” This item is very similar to the one used by Cottrell (5). 

Another item was: “How tense or relaxed do you feel now?” The Ss 
responded on a 61 point, a priori scale which ranged from “Rather Tense” 
(—3.0) to “Rather Relaxed” (3.0). 

Additional items determined the degree of pre-experimental acquaintance 
among Ss in a session, whether they had seen another’s scores on the task, and 
if they felt others had seen their scores. 


C. ResuLTS 


Presession and within-session communication among Ss was slight and 
homogeneous among the experimental conditions. There was no talking among 
the Ss, Four of the 48 Ss reported a slight degree of prior acquaintance with 
one other $ in their session. Three Ss reported that they knew the scores of 
another girl in their session and three Ss reported they felt one of the other 
girls knew their scores. Fisher exact tests indicated that none of the commu- 
nication measures was significantly related to any of the main effects, inter- 
actions, or simple effects of the experimental variables, 


1. Estimates of Future Success 


Column 1 of Table 1 presents the mean estimates of the number of successes 
that would be achieved on a new set of 10 trials. There are several significant 
effects. The estimates of High Expectancy subjects are significantly (6 < .001) 
larger, X — 7.25, than those of the Low Expectancy subjects, X = 3.79, 
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F (1, 30) = 143.57. This finding validates the experimental manipulation of 
level of Expectancy. Subjects in the High Expectancy condition did expect 
that they would perform more proficiently than did subjects in the Low Expee- 
tancy condition. The Expectancy Strength X Expectancy effect is significant 
(p < .005), F (1, 30) = 11.02, as well. The interaction is of the form that 
variation in Expectancy has a greater effect upon the success estimates in the 
Strong Expectancy condition than in the Weak Expectancy condition. 


TABLE 1 
Means or NUMBER or Successes EXPECTED, CHANGES IN RT TASK PROFICIENCY, 
AND RATINGS OF TENSION FOR THE EIGHT EXPERIMENTAL CONDITIONS 
(Each mean is based on six cases) 


Success Tension 
Experimental condition estimates RT change” ratings® 
Weak Expectancy 
Expectancy Performancea 
Low Low 3.17 5.83 EL 
Low High 6.00 —10.50 — 4$ 
High w 5.83 — 6.83 — 0 
High High 8.33 1.17 65 
Strong Expectancy 
Expectancy Performance 
Low Low 2.83 —10.83 45 
Low High 3.17 4.17 8 
High Low 6.50 7.17 -3$ 
High High 8.33 2.17 3% 


а Performance is level of final feedback. 

b Changes are in milliseconds. Negative change indicates replacement trial block 
RT slower than discarded trial block RT. 

© Negative score indicates report of feeling tense. 


Of greatest relevance to dissonance reduction are the findings concerning 
the Performance variable (level of final feedback). If persons reduce disso- 
nance by rejecting a discrepant feedback, then persons with a final feed 
that was “too high” should not differ in their success estimates from persons 
who consistently received low feedbacks. By the same token, persons with à 
final feedback that was “too low” should not differ in their success estimates 
from persons who consistently received high feedbacks. The rejection of im 
consistent information would reduce the effectiveness of the Performance 
variable and, since dissonance is greatest in the Strong Expectancy condition, 
the Performance variable should be least effective in that condition. 

‘The analysis of variance of the data in column 1 of Table 1 revealed that 
the Performance effect is significant (р < .001), F (1, 30) = 42.19. 
estimates of the High Performance subjects are larger, Ж — 6.46, than 
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estimates of the Low Performance subjects, X = 4,58, The interaction be- 
tween Expectancy strength and Performance is significant (> < .02), F (1, 
30) = 7.52, as well. This interaction is of the form that only in the Weak 
Expectancy condition did a discrepant final feedback influence the success 
estimates. Inspection of rows 1 through 4 of column 1 of Table 1 shows that 
in the Weak Expectancy condition the estimates of the dissonance subjects 
differed greatly from the estimates of the consistent controls, The average 
estimate in the Low Expectancy-Low Performance condition was 3.17 and 
in the Low Expectancy-High Performance condition the average estimate 
was 6.00. In the High Expectancy-High Performance condition the average 
estimate was 8.33 and in the High Expectancy-Low Performance condition 
the average estimate was 5.83. When we combine both of these differences, the 
average estimate in the Weak Expectancy-High Performance condition is 
7.16 and in the Weak Expectancy-Low Performance condition 4.50. The dif- 
ference between these means is significant (p < .001), t = 3.701, df = 22. 
Thus, in the Weak Expectancy condition, when little dissonance was aroused 
by a discrepant feedback, the subjects were very responsive to a discrepant 
feedback when estimating their task proficiency, 

Inspection of rows 5 through 8 in column 1 of Table 1 reveals that in the 
Strong Expectancy condition the estimates of dissonance subjects differed little 
from the estimates of the consistent controls. The average estimate in the Low 
Expectancy-Low Performance condition was 2.83 and in the Low Expectancy- 
High Performance condition the average estimate was 3.17. In the High 
Expectancy-High Performance condition the average estimate was 8.33 and 
in the High Expectancy-Low Performance condition the average estimate 
was 6.50. When we combine both these differences the average estimate for 
the Strong Expectancy-High Performance condition was 5.75 and the Strong 
Expectancy-Low Performance condition 4.67. This difference was not signifi- 
cant by a t-test. Thus, in the Strong Expectancy condition, when a relatively 
large magnitude of dissonance was aroused by a discrepant final feedback, the 
subjects rejected the discrepant final feedback when estimating their task 
- proficiency. 

The findings in the Strong Expectancy condition are almost identical to 
of an carlier study (5). 


2. Task Proficiency 


É The changes in task proficiency are shown in column 2 of Table 1. If per- 
adjust their task proficiency in order to reduce dissonance, then those 
a final feedback that was “тоо high” should reduce their task proficiency 
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relative to the consistent controls and those with a final feedback that was 
“too low” should increase their task proficiency relative to the consistent con- 
trols. These changes would result in a significant effect of Performance upon 
the RT change scores. Such was not the case. The analysis of variance revealed 
that neither the Performance effect nor the Expectancy Strength X Perfor- 
mance effect was significant. There was no evidence that persons adjust their 
task proficiency in order to reduce dissonance between expected and obtained 
performance. 

There was some indication that task proficiency changed as a joint function 
of disconfirmation and Expectancy Strength. The three-factor interaction 
approaches statistical significance (р < .08), F (1, 30) = 3.48. The inter- 
action has the form that the stronger the disconfirmed expectancy, the more 
task proficiency improved relative to the consistent controls. Because of its 
borderline statistical significance, this effect will not be discussed further. 


3. Tension 


Column 3 of Table 1 presents the mean ratings of experienced tension. 
Only the Expectancy X Performance effect is significant (p < .025), F (1, 
30) = 6.84. The interaction has the form that Ss in dissonance conditions 
(High Expectancy-Low Performance and Low Expectancy-High Perfor- 
mance) reported that they felt more tense, Х = —.70, than the Ss in the 
consistent control conditions (High Expectancy-High Performance and Low 


Expectancy-Low Performance), X = .60. 


D. Discussion 


In the present study the effects of expectancy-performance dissonance were 
found to be confined to judgmental processes. Task proficiency was not affected. 
When a relatively large magnitude of dissonance was aroused, persons Te- 
jected inconsistent information when estimating their proficiency at the task. 
But, regardless of the magnitude of dissonance, persons did not adjust their 
task proficiency in order to reduce dissonance between expected and obtained 
performance. 

Perhaps the subjects failed to adjust their task proficiency because the 
experimental manipulations of the present study failed to arouse cognitive 
dissonance. There is no support for this interpretation. Dissonance-reduction 
effects were obtained on the judgmental measure, estimates of future success- 
Also, subjects in the dissonance conditions (High Expectancy-Low Perfor- 
mance and Low Expectancy-High Performance) reported that they felt tense 
relative to the consistent controls (Low Expectancy-Low Performance and 
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High Expectancy-High Performance). Apparently, the manipulations of the 
present study produced the psychological discomfort that characterizes cogni- 
tive dissonance (6, p. 3). 

Lecky’s (7) theory of self-consistency has important implications for educa- 
tion and training. In terms of self-consistency theory, in order to produce а 
change in performance level, an appropriate change in the expected level of 
performance must be effected first. Otherwise, if a person performs at an un- 
accustomed level of proficiency, he will then reject the new level by changing 
his performance so as to reinstate the disconfirmed expectancy. The results 
of the recent studies (1, 3, 5) might appear to confirm Lecky’s analysis. How- 
ever, these studies did not use a performance task. The dependent variables 
were judgmental responses. The present study is the only one of the recent 
set that allowed the subjects an opportunity to reduce dissonance by adjust- 
ing their actual task proficiency. The results of the present study indicate 
that the performance-expectancy effect is confined to judgmental processes. 
Dissonance between expected and obtained performance affects judgments, 
but not task proficiency. Lecky’s analysis still lacks experimental confirmation. 


E. SUMMARY 


The objective of this study was to determine whether persons adjust their 
actual task proficiency, as well as change their judgments, in order to reduce 
dissonance between expected and obtained performance. Forty-eight females 
were led to expect good or poor performance on an RT task. The strength 
of the expectancy was varied also. False feedback cither confirmed or discon- 
firmed the expectancy. Ss in the dissonance conditions reported they felt tense 
and Ss with strong expectancies reduced dissonance by adjusting their estimates 
of future success. However, Ss did not adjust their task proficiency in order 
to reduce dissonance. A discussion of these and other findings suggested that 
the performance-expectancy effect might be limited to judgmental processes. 
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A. INTRODUCTION 


Past studies of the college dropout have generally dealt with student charac- 
teristics, with a few studies of institutional characteristics (5, 12). Largely 
neglected has been the area of student-environment interaction, though there 
is evidence that the analysis of such interaction is crucial to our understanding 
of all phases of the learning process (1, 4, 6, 7, 10). 

Beyond some few common characteristics, colleges are vastly different from 
one another (11). Furthermore, there is evidence that students generally 
select a college whose image fits their own needs, providing for a match 
between student and college environment (9). In many cases, however, stu- 
dents are forced to choose on some other basis, have a distorted image of their 
preferred college or have an unrealistic image of their own needs. Each of 
these means a lack of fit between the needs of the individual and the press 
or sources of reward and frustration in the college environment. Stern (11) 
reports that at the University of Chicago a minority group of authoritarians 
contributed most heavily to the withdrawal rate. Similarly, Funkenstein (3) 
has tied the issue of fit between student needs and college attributes to drop- 
outs from medical schools. 


A lack of fit between student and college characteristics should also lead 
to some kinds of dissatisfaction with the college experience, which ultimately 
may lead to dropping out of college. A study by Davie (2) gives some indirect 
evidence that this is in fact the case. The research reported in this paper inves- 
tigated how discrepancies between the student’s perception of himself and his 
college relate to dissatisfaction with college and probability of dropping out. 
It was hypothesized that the greater the discrepancy between the way a student 
sees himself and his image of the college, the more he will be dissatisfied with 


* Received in the Editorial Office, Provincetown, Massachusetts, on June 10, 1966. 
Copyright, 1967, by The Journal Press. 

1 This study was supported by Grant MH08321 from the National Institute of 
Mental Health to the senior author and Grants GE4039 and GE6289 of the NSF 
Undergraduate Research Participation Program to the junior author. 

2 Now at Harvard University. 


285 


286 JOURNAL OF SOCIAL PSYCHOLOGY 


college and consider dropping out. Furthermore, since self-college discrepancies 
should be tied more to nonacademic than academic affairs, it was predicted 
that the above relationship would hold more for nonacademic than for academic 
dissatisfaction, and more for dropping out for nonacademic (personal) reasons 
than for academic reasons. 


B. METHOD 
1. Subjects 


Subjects were 50 Princeton University upperclassmen who were paid 
participants in a study of cognitive style and physiological functioning. The 
data relevant to the current study were obtained individually from the sub- 
jects two weeks after the completion of the other study. 


2. Materials and Procedure 


A questionnaire was used to assess the students’ perceptions of their own 
personality characteristics, of the characteristics associated with Princeton 
College and different parts of the college. The Instrument for the Trans- 
actional Analysis of Personality and Environment (ITAPE) was developed 
for use in this study and has now been administered at other colleges through- 
out the country. The ITAPE is based on the semantic differential techniques 
of Osgood (8) and asks students to rate a number of concepts on the same 
set of polar adjective scales. In its standard form the ITAPE requires that 
the following concepts be judged on 60 polar adjective scales: College, Self, 
Administration, Faculty, Students. Concepts relevant to a college refer to the 
college the student is attending. Examples of the polar adjective scales are 
authoritarian-democratic, grinding-fun-loving, religious-secular, equalitarian- 
status-oriented. Two forms (A and B) of the test are under development? 

A slight revision in the ITAPE was made for the purposes of this investi- 
gation. The concepts used in this study were College, Self, Students, and Ideal 
College. For each concept an explicit description of what was meant by the 
concept was included to avoid any possible ambiguity. Subjects took Form A 
of the test on one day and Form B of the test a couple of days later. Since 
each form included 60 scales, every concept was rated by each subject on 4 
total of 120 scales. At the end of Form A the subjects answered 16 questions 
that covered the tendency to drop out of college, satisfaction with college, 
and frustrations at college. Questions used to investigate the student's reported 
probability of dropping out were the following: 1. How likely is it that you 


5 Copies of the ITAPE may be obtained from the senior author. 
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will at some time drop out of college? 2. How likely is it that you will at 
some time drop out for academic reasons (poor grades)? 3. How likely is 
it that you will at some time drop out for nonacademic reasons (personal 
reasons, transfer, etc.) ? 4. How often do you think about dropping out of 
college? Questions dealing with dissatisfaction with college were 1. All in 
all, in terms of your own needs and desires, how satisfied are you with the 
academic aspects of your college? 2. All in all, in terms of your own necds 
and desires, how satisfied are you with the nonacademic aspects of your college? 
3. So far, what kinds of times have you had at your college? Answers to 
these questions were given on a 10-point scale. These answers were used as 
measures of the tendency to drop out of college and dissatisfaction with 
college.‘ 

An additional part of this study was the investigation of the relationship 
between attitude towards dropping out and probability of dropping out. At 
the end of Form B subjects rated the concept Dropping Out on the follow- 
ing scales: good-bad, foolish-wise, successful-unsuccessful, worthless-valuable, 
safe-dangerous, pleasant-unpleasant, cowardly-brave, honest-dishonest, sad- 
happy, weak-strong. 

C. RESULTS 


The hypotheses in this experiment stated that discrepancies between per- 
ceptions of the self and perceptions of the college would be related to the re- 
ported probability of dropping out and dissatisfaction with the college, and 
that these relationships would be stronger for nonacademic variables than for 
academic variables. Since the concepts were being judged on the same scales, 
a discrepancy score between two concepts was calculated by summing the 
absolute value of the difference in ratings between the two concepts. For 
each pair of concepts three discrepancy scores were calculated: the sum of 
the differences in ratings on the 60 scales in Form A, the sum of the differ- 
ences in ratings on the 60 scales in Form B, and, finally, the sum of the 
differences in ratings on the 120 scales. Discrepancy scores were calculated 
between the following pairs of concepts: Self-College, Self-Students, College- 
Ideal College. 


The product-moment correlations between the discrepancy scores and re- 


* Many of the satisfaction items were based on a questionnaire developed by James 
Davie at the Yale University Health Services. While reports of a tendency toward 
opping out are not necessarily related to actual dropout behavior, they may be 
taken as signs of stress and tendencies toward leaving the field. Actual dropout 
avior is determined by so many factors (the draft, available jobs, parental atti- 
tudes, etc.) that reports of thinking about and a tendency toward dropping out may 
more relevant to the issues raised in this study. 


TABLE 1 


CORRELATIONS BETWEEN DISCREPANCY Scores AND DROPOUT VARIABLES 


ITAPE Any 
Concept form dropout 
Self- Form A 21 
College Form B 18 
A+B ai 
Self- Form A at 
Students Form B 18 
A+B 25 
College- Form A .33* 
Ideal Form B .20 


College A+B Age 


* 5 < .05 two-tailed test. 
** p < .01 two-tailed test. 
##* 5 < .001 two-tailed test. 
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sponses to the dropout questions are given in Table 1. Responses to the drop- 
out questions were keyed so that a high score always indicated a high prob- 
ability of dropping out. These data indicate that a perceived discrepancy 
between Self and College is not significantly related to reported probability 
of dropping out for academic reasons but is significantly related to reported 
probability of dropping out for nonacademic reasons. In all cases the dis- 
crepancies are more clearly related to the nonacademic variables than to the 
academic variables. The correlations for Forms A and B are generally quite 
similar, 

The correlations between the discrepancy scores and responses to the dis- 
satisfaction questions are given in Table 2. Responses to the dissatisfaction 
questions were keyed so that a high score always indicated great dissatisfaction. 
While there is some evidence of a relationship between perceived discrepancies 
and academic dissatisfaction, the relationship of the discrepancies to non- 
academic dissatisfaction is far more impressive. The correlations for Forms A 
and B are generally quite similar. 

Data from the semantic differential with the concept Dropping Out were 
analyzed by keying all scales so that a high score indicated a favorable attitude 
toward dropping out. Scores on the 10 scales were summed to give a total 
score for attitude towards dropping out. The correlations between dropout 
attitude scores and dropout variables are presented in Table 3. Once more 
the relationship for academic dropout is low compared to that for nonacademic 
dropout. 

Since the data indicated some independence of dropout attitude from per- 
ceived discrepancies, these two sets of data were combined and related to the 
dropout variables. Multiple correlation coefficients for dropout attitude and 
perceived discrepancies (sum of Forms A and B) related to the dropout 
variables are presented in Table 4. The relationships continue to be much 
higher for nonacademic variables than for academic variables. Dropout atti- 
tude and perceived discrepancy together account for 30 to 45 per cent of the 
variance in reported probability of dropping out for nonacademic reasons. 


D. Discussion 


The data in this study have supported the view that discrepancies between 
perceptions of the self and the college, the self and students, and the college 
and ideal college are significantly related to the reported probability of 
dropping out for nonacademic reasons and to nonacademic dissatisfaction with 
college. Furthermore, these data clearly indicate the utility of distinguishing 
between academic and nonacademic dropouts and sources of frustration. For 


TABLE 2 


CORRELATIONS BETWEEN DISCREPANCY SCORES AND DISSATISFACTION VARIABLES 


ITAPE Academic Nonacademic Kind of Nonacademic + 

Concept form dissatisfaction dissatisfaction times kind of times 
Self- Form A .26 53^ .39** 53990 
College Form B .28* «54—09 53% $1909 
A+B .26 ry ped 49% 361929 
Self- Form A .12 5009% „50% A T ene 
Students Form B .26 rece 57 61988 
A+B 22 Ageee 52%%® 56677 
College- Form A ы 67066 A79 .66*** 
Ideal Form B .26 .Sg** agers .60*** 
College A+B КУБЫ „Занев AS 60%** 


* p < .05 two-tailed test. 
** p < .01 two-tailed test. 
*** 5 < .001 two-tailed test. 
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TABLE 4 
MULTIPLE CORRELATION COEFFICIENTS RELATING DROPOUT ATTITUDE AND 
СомсЕРТ Discrepancies (A + B) то DROPOUT VARIABLES 


Any Academic Nonacademic Thinking Think + 
Concept dropout dropout dropout dropout nonacademic 
Dropout attitude + 
Self-College 42** .30* ЧК fecha soe 64*** 
Self-Students E dd 12 .56"@6 .5ge»e 26778 
College-Ideal College PINE 12 6498 56*** ise? 


* p < .05 two-tailed test. 
** 5 < .01 two-tailed test. 
*** 5 < .001 two-tailed test. 
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this population, the reported probabilities of dropping out for academic and 
nonacademic reasons correlated .18, and academic satisfaction correlated .32 
with nonacademic satisfaction. 

Finally, the data indicate that a favorable attitude toward dropping out 
is significantly related to the perceived probability of dropping out. At this 
time, however, we do not know whether dropping out is perceived more 
favorably because it is perceived as probable, or it is perceived as more probable 
because it is perceived favorably, or whether both may occur. While this study 
has dealt with reported probability of dropping out, another study will have 
to determine the relationship between such reports and actual behavior; that 
is, do students who report a high probability of dropping out actually do so? 

The present study indicated a relationship between the perceived dis- 
crepancies and the dropout and satisfaction variables. In understanding such 
relationships, it is valuable to know the scales on which discrepancies between 
the concepts were most likely to occur and which discrepancies were most 
significantly related to the dropout and satisfaction variables. Relevant data 
were obtained from the mean ratings on the scales for the concepts for high 
and low concept discrepancy groups and high and low nonacademic dissatis- 
faction groups. These data indicated that the main source of variance was 
in the way students perceived the college, with some variance being related 
to perceptions of the self, students, and the ideal college. Compared to the 
low perceived discrepancy and low dissatisfaction group, those in the high 
discrepancy and high dissatisfaction group rated the college as higher on the 
following items: dull, bored, uncreative, competitive, tense, ritualistic, con- 
ventional, conservative, Republican, status-oriented, militaristic, repressive, 
conforming, coercive, detached, snobbish, cold, callous, unfriendly, intolerant. 
Some of these items appear to reflect an evaluative dimension, while others 
reflect social and political values and means of impulse expression. 

While the findings reported here support both the individual-environment 
interaction approach and the utility of the ГГАРЕ, further research is neces- 
sary to define further the relevant environmental and personality variables. 
Longitudinal studies and comparative studies of different colleges are needed. 
Also, the possibility that different personalities deal with discrepancies between 
themselves and their environment in different ways should be investigated. 
Finally, while similar to Stern's Activities Index and College Characteristics 
Index, and to Pace’s College and University Environment Scales, the ITAPE 
has some unique properties and appears to warrant further development. It 
shows promise of picking up differences between schools on single concepts 
and between schools in the relationships of the College, Self, Administration, 
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Faculty, and Student ratings. In sum, it shows promise of developing into 
a useful technique for investigating the similarity of meaning of concepts 
relevant to the educational environment. The time may come when we will 
be able to indicate to the college-bound student at which college he is likely 
to mature and find personal satisfaction, and at which colleges he is likely 
to be traumatized due to conflicts between his personality and the charac- 
teristics of the college environment. 


E. SUMMARY 


This study predicted that discrepancies between the student’s perception of 
himself and his college environment would be related to his tendency toward 
dropping out of college and his dissatisfaction with college. It was predicted 
that this relationship would hold more for dropping out for nonacademic than 
for academic reasons, and more for nonacademic dissatisfaction than for 
academic dissatisfaction. Fifty Princeton upperclassmen rated the concepts 
Self, College, Students, and Ideal College. These concepts were part of the 
Assessment of Personality and Environment Test based on the semantic differ- 
ential technique. Discrepancies between Self and College, Self and Students, 
and College and Ideal College ratings were significantly related to reported 
probability of dropping out for nonacademic reasons and to nonacademic dis- 
satisfaction with college. The data supported the above predictions. Attitude 
toward dropping out was also found to relate significantly to reported prob- 
ability of dropping out. The significance of these findings for the study of 
student-college interaction and potential uses of the Assessment of Personality 
and Environment Test were discussed, 
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CROSS-CULTURAL NOTES 


Under this heading appear summaries of studies which, in 500 words or 
less, provide comparable data from two or more societies through the use of 
a standard measuring instrument; additional details concerning the results 
can be obtained by communicating directly with the investigator. 


The Journal of Social Psychology, 1967, 72, 297-298. 


VALUES OF STUDENTS IN FOUR PROFESSIONS UNDER 
INDIAN CONDITIONS* 


Department of Education, University of Allahabad, India 


5. К. Pau 


The Allport-Vernon Lindzey Values Test was administered to groups of 
50 engineering, law, medical, and teacher-training students at the University 
of Allahabad; all were male and in their fourth or final year. 

In a comparison of engineering and law students, engineering students 
yielded a significantly higher value on the Economic Scale, while law students 
revealed a significantly higher value on the Political Scale. When engineering 
and medical students were compared, engineering students exhibited signifi- 
cantly higher values on the Economic and Aesthetic Scales, while medical 
students yielded significantly higher values on the Theoretical and Social 
Scales. Engineering students were found to have significantly higher scores 
than teacher-training students on the Economic value, while teacher-training 
students showed significantly higher score on the Political value. 

The results of the application of ¢-tests to the differences yielded by law 
and medical students on the Study of Values showed that law students scored 
significantly higher than medical students on the Aesthetic and Political 
values, while medical students scored significantly higher than law students 
on the Theoretical and Social Scales. No significant value differences were 
found between law and teacher-training students on the six value variables. 
In a comparison of medical and teacher-training students, medical students 
revealed a significantly higher value on the Social Scale, while teacher-training 
students indicated significantly higher values on Aesthetic and Political Scales. 

Findings in general lead to the conclusion that differences in values do 


® Received in the Editorial Office, Provincetown, Massachusetts, on February 20, 
1967, and given special consideration in accordance with our policy for cross-cultural 
research, Copyright, 1967, by The Journal Press. 
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exist between students preparing for the professions of enginecring, law, 
medicine, and teaching. The question whether students develop these values 
with training, or whether students who have such values choose to enter these 
professions, cannot be answered by means of this study alone. 


Department of Education 
University of Allahabad 
Allahabad, India 


The Journal of Social Psychology, 1967, 72, 299-300. 


A CROSS-VALIDATION OF THE MARLOWE-CROWNE 
SOCIAL DESIRABILITY SCALE ON AN INDIAN SAMPLE* 


Nagpur University, Nagpur, India 


Bisuwa NATH MUKHERJEE 


Goldfried? has recently criticized the Marlowe-Crowne Social Desirability 
Scale (M-C SD Scale) on the ground that almost half of its items “failed 
to discriminate beyond chance when cross-validated.” The purpose of the 
present study was to check the above observation and to determine the extent 
of sex difference on each item of the M-C SD Scale. The Ss were 50 male and 
50 female students, with a mean age of about 21 years, enrolled in various 
graduate departments of Nagpur University. A very slightly modified form 
of the M-C SD Scale was given to the Ss with standard instructions, The 
25 high scorers and 25 low scorers on the total M-C SD Scale formed the 
two criterion groups for each sex. 

Out of 33 items, 29 showed satisfactory internal consistency as indicated 
by x2. Nine of these items were ones also found by Goldfried to discriminate 
for the male group at the .05 level or better. In the case of females, only 
five items were found to be discriminating in both studies. Significant sex 
differences were observed in the present study with regard only to seven items, 
most of which were different from the significant ones in Goldfried’s study. 
‘The mean scores on the total scale for males and females were 20.1 and 21.1, 
respectively. The standard deviations were 4.78 and 3.89. The critical ratio 
turned out to be nonsignificant. 

Although the M-C SD Scale as a whole failed to show any sex difference, 
a few items of it yielded significant results. Thus, while Goldfried found no 
sex difference on Item 3, the present study revealed that females tend to 
endorse the item more frequently than males, Such differences can be inter- 
preted in terms of the way the traits represented by a particular item are 
perceived as socially approved or disapproved. The fact that, with two excep- 
tions, none of the items showing sex differences in Goldfried’s study yielded 
the same results in this investigation may be also accounted for in terms of 


* Received in the Editorial Office, on March 3, 1967, and given special consideration 
in accordance with our policy for cross-cultural research. Copyright, 1967, by The 
Journal Press. 

1 Goldfried, M. R. A cross-validation of the Marlowe-Crowne SD Scale items. 
J. Soc. Psychol., 1964, 64, 137-145. 


299 


300 JOURNAL OF SOCIAL PSYCHOLOGY 


differential cultural expectations. There are a few items that failed to show 
sex difference in Goldfried’s study but that are revealing in the present one, 
a finding that may be explained in terms of the differential standards set for 
men and women in India. 

Since the present study shows that most of the items of the M-C SD Scale 
are internally consistent, it must be concluded that the main result of 
Goldfried is not cross-validated. In addition to showing the importance of 
cross-validating item-analysis results, the findings reported here suggest how 
item-analysis can be used in cross-cultural studies. 


Department of Psychology 

Nagpur University 

Gandhi Bhavan, North Ambazari Road 
Nagpur-1, M.S., India 
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ATTITUDES OF FILIPINO AND AMERICAN COLLEGE 
STUDENTS ASSESSED WITH THE SEMANTIC 
DIFFERENTIAL* * 


Naval Aerospace Medical Institute, Pensacola, Florida; and Department 
of Psychology, New York University 


ALLEN J. SCHUH AND CARMENCITA C. QUESADA 


The present paper reports on the use of the semantic differential in a study 
of the comparative attitudes of Filipino and American college seniors toward 
certain personal and educational concepts. Because of the similarity in lan- 
guage, educational system, religion, system of government, age, and educa- 
tional level of the Subjects (Ss), the measurement of the meaning of the 
concepts could be conducted and attributed to cultural background while 
many variables that usually confound research studies of this nature are 
held constant. 

The American Ss were 153 males and 135 females enrolled at the Univer- 
sity of California, Berkeley, while the Filipino Ss were 36 males and 175 
females attending the University of the East, Manila. Ten bipolar adjective 
scales (good-bad, active-passive, fast-slow, strong-weak, soft-hard, aggressive- 
defensive, rash-cautious, sensitive-insensitive, sincere-insincere, boring-interest- 
ing) were used to rate four concepts (Myself, The Way I Would Most 
Want To Be, My Role as a Student, My Faculty Advisor). 

The data were combined according to sex of the Ss, and Pearson product- 
moment correlation coefficients were computed between the scales and the 
criterion of American versus Filipino cultural group. The significance of the 
deviation of the correlation coefficients from zero was assessed with a two- 
tailed £ test. For the female group, 27 of the 40 scales exceeded the .05 level 
of significance, while 17 of the 40 male scale comparisons exceeded the .05 
level. Only adjective scales validating across both male and female groupings 
are reported. 

The groups were judged as equivalent on the self-concept with the excep- 
tion that the American students saw themselves as more cautious than their 


* Received in the Editorial Office, on March 7, 1967, and given special consideration 
in accordance with our policy for cross-cultural research. Copyright, 1967, by ‘The 
Journal Press. 

1 The senior author is on active duty in the United States Navy. Opinions or con- 
clusions contained in this report are those of the authors. They are not to be construed 
as necessarily reflecting the views or the endorsement of the Navy Department. 
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Filipino counterparts. On the ideal-self measure, the American students 
described themselves as aspiring to be'more hard, aggressive, sincere, and 
interesting than their Filipino counterparts. The Filipino students more 
favorably described the education-related concepts than did the American 
students. The Filipinos described their faculty advisors as more good, active, 
fast, strong, soft, and sincere, and their roles as students as more good, active, 
sensitive, and sincere than their American counterparts. 

These preliminary findings lead to the following three conclusions: (a) 
these particular students in the Republic of the Philippines and in the United 
States of America see their educational environments as noticeably different; 
(b) the differences between the female samples are more numerous than 
between the male samples; and (c) Filipino and American college seniors 
have extremely similar evaluations of themselves. 


Allen J. Schuh 

Naval Aerospace Medical Institute 
Naval Aerospace Medical Center 
Pensacola, Florida 32512 
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